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SYSTEMATIC CHANGES IN WORD ASSOCIATION NORMS: 
1910-1952! 


JAMES J. JENKINS 
Center for Advanced Study in the Behavioral Sciences 


anD WALLACE A. RUSSELL 
University of Minnesota 


NORM may be defined as a standard 
A and a standard, as everyone knows, is 
an unchanging rule to which appeal 
may be made for precise measurement. Per- 
haps it is this verbal legerdemain which leads 
psychologists to overlook the fact that norms 
which are based on consensus or popularity of 
responses are especially susceptible to change 
and require frequent checking in our rapidly 
changing society. 

Word association norms, an example par 
excellence of the consensus norms, have been 
commonly assumed to be stable and highly 
consistent from one time to another. Indeed, 
investigators rarely raise the issue of the pos- 
sibility of normative changes but concentrate 
their attention on differences between groups 
already known to be disparate. Thus, in addi- 
tion to the very numerous attempts to com- 
pare and contrast clinical groups, many studies 
have been directed toward comparisons of 
responses of normal individuals known to 
differ in some respect. Esper (1918), for 
example, compared the most frequent re- 
sponses of American and German students to 
a standard list of stimulus words and Rosen- 
zweig (1957) contrasted the responses of 
American and French students; children’s 
responses have been contrasted with adult’s 
responses (Rosanoff & Rosanoff, 1913; Wood- 
row & Lowell, 1916); the responses of men 
have been contrasted with those of women 


1 This study is part of a program of research in 
verbal behavior being conducted at the University of 
Minnesota. This study was sponsored by the Office of 
Naval Research (Contract N 8-onr-66216) under its 
policy of encouraging basic research. Research assist- 
ants who were especially concerned with the material 
were Evalyn Segal, Walter Mink, and Rochelle Johnson. 
The Graduate School of the University of Minnesota 
also assisted with research funds. The senior author is 
happy to acknowledge his indebtedness to the Social 
Science Research Council for a Faculty Research Fel- 
lowship during the period in which the analysis was 
conducted. 


(Terman & Miles, 1936; Goodenough, 1942); 
and the responses of-one preprofessional group 
have been compared with those of another 
(Foley & MacMillan, 1943). 

While these studies have been both interest- 
ing and informative, it is illustrative of psy- 
chologists’ blindness to the normative problems 
resulting from social change, that little atten- 
tion has been given to detailed comparisons 
of even roughly matched groups tested in 
different years widely separated in time. As 
far as we have been able to discover only three 
investigations have mentioned the question. 

O’Connor (1934) in the course of an exten- 
sive item analysis of the Kent-Rosanoff test 
data he had collected on a sample of “male 
factory workers” briefly compared the popular 
responses to those of the “mixed adult sample” 
of Kent and Rosanoff (1910). However, he 
did not pursue the matter further than 
remarking: “The common response to a 
stimulus word is, therefore, a scientific reality 
in the sense that it is rediscoverable by differ- 
ent workers in a new laboratory after a lapse 
of 15 years,” and “the identity of a common 
response is more reproducible than its fre- 
quency of appearance” (O’Connor, 1934, pp. 
200-201). Recently, Tresselt and Leeds (1955) 
compared word association responses collected 
in 1952 on a small sample, heterogeneous with 
respect to residence and occupation with “the 
original norm on the Kent-Rosanoff estab- 
lished in 1927” (actually the 1910 norms). 
They found a marked increase in primary re- 
sponses which were responses of opposites 
(Dark-Light, Black-White) and coexisting 
pairs (King-Queen, Salt-Pepper). Finally, 
Dérken (1956) studied 10 high frequency 
stimulus-response pairs from the Kent- 
Rosanoff list in various normative collections 
and discovered that the frequency of these re- 
sponses seemed to be increasing steadily since 
1910. 
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Apparently, though these writers suggest 
that changes are underway, a detailed study of 
changes in word association responses over 
time remained to be done. 

A few years ago the writers fortuitously dis- 
covered that the 25-year-old word association 
norms for students at the University of Min- 
nesota did not adequately represent the asso- 
ciative habits of the present student popula- 
tion. To facilitate the derivation of functional 
relationships in another problem (Jenkins, 
Mink, & Russell, 1958) new norms were 
gathered for the Kent-Rosanoff Word Associa- 
tion Test (K-R) for students in introductory 
psychology at this university. As these norms 
were studied and compared with the earlier 
Minnesota norms collected by Schellenberg 
(1930), it became obvious that the changes 
were not minor, trivial adjustments of a few 
response frequencies (as we had first suspected) 
but represented a fairly major and systematic 
change in the norms. Further work suggested 
that the change was not a peculiarity of sam- 
pling or technique but a phenomenon of such 
scope that a comparison with more diverse 
populations was indicated. Accordingly, other 
major American normative collections were 
studied. 

This paper presents first the comparison be- 
tween the early and later Minnesota norms 
and then goes on to report the major findings 
of the comparison of these two sets of norms 
with the other large-scale normative collec- 
tions we have been able to discover. The pur- 
pose of the study is to attempt to make clear 
both the magnitude and the nature of the 
changes revealed by the comparisons of word 
association norms and to argue that these 
changes reveal a systematic and important 
trend in responding to free association tests, 
which deserves further careful consideration 
by students of verbal behavior, language and 
culture. 


CHARACTERISTICS OF THE FivE Norms 


We shall set down at once what is known 
concerning date of collection, characteristics 
of samples, and conditions of testing for each 
of the norms. All five studies employed the 
same stimulus list and presumably used the 
same order of presentation. 
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Norm 1910: Norms collected by Kent and Rosanoff 
(1910); published in 1910; collection dates not 
reported. 

Testing: Oral form; individual test. 

Sample: 1000 “mixed adults.” 

Among these subjects were persons of both sexes 
and of ages ranging from eight years to over eighty 
years, persons following different occupations, 
possessing various degrees of mental capacity and 
education, and living in widely separated localities. 
Many were from Ireland and some of these had but 
recently arrived in this country; others were from 
different parts of Europe, but all were able to speak 
English with at least fair fluency. Over two hwadred 
of the subjects, including a few university professors 
and other highly practiced cbservers, were profes- 
sional men and women or college students. About 
five hundred were employed in one or another of the 
New York State hospitals for the insane, either as 
nurses and attendants or as workers at various trades, 
the majority of these were persons of common school 
education, but the group includes also, on the one 
hand a considerable number of high school graduates; 
and on the other hand, a few laborers who were al- 
most or wholly illiterate. Nearly one hundred and 
fifty of the subjects were boys and girls of high school 
age, pupils of the Ethical Culture School, New York 
City. The remaining subjects form a miscellaneous 
group, consisting largely of clerks and farmers (pp. 
38-39). 


Norm 1925: Norms collected by O’Connor (1928, 
1934); published in 1928; collected in 1925. 
Testing: Presumably oral form, individual test; no 
details are given. 
Sample: “1000 adult male factory workers”; no fur- 
ther description of sample. 


Norm 1927: Norms collected by Schellenberg (1930) ; 
published in 1930, collected in fall, 1927. 

Testing: Group test, printed form, written answers; 
instructions suggest time pressure. Other stimulus 
words were employed following the K-R list. 

Sample: 925 students entering the University of 
Minnesota. Seventy-nine per cent were freshmen; 21% 
advanced students. Fifty-seven per cent men; 43% 
women. 


Norm 1933: Norms collected by Keene (1951); pub- 
lished in 1951; collected in academic year 1933-34. 
Testing: Oral form; individual test. Other stimulus 
words were employed following the K-R list. 
Sample: 

Five hundred Stanford students, 276 men and 224 
women ... The group included individuals from all 
four undergraduate levels and also included some 
graduate students. They were students from the 
various psychology classes and their friends (p. 43). 


Norm 1952: Norms collected by Russell and Jenkins 
(1954); published in 1954; collected in academic 
year 1952-53. 

Testing: Group test, printed form, written answers. 
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Instructions suggest a time pressure. (Same instruction 
with regard to time as Norm 1927.) 

Sample: 1,008 students in introductory psychology 
at the University of Minnesota. Largely sophomore 
population. About 60% male, 40% femaie. 


COMPARISON OF THE 1927 AND 
1952 Minnesota Norms 


Since the comparison of old and new Min- 
nesota norms stimulated the remainder of the 
study and since this is the most readily justi- 
fied as to comparability of subjects (Ss) and 
method, the Minnesota comparison is given 
first and in detail. The 1927 and 1952 norms 
were compared with respect to the following 
characteristics: (¢) frequency of primary, 
secondary, and tertiary responses, (b) number 
of changes in primary, secondary, and tertiary 
responses, (c) relation of the strength of the 
primary frequencies to the same stimulus 
words in the two norm sets, the response rank- 
frequency distribution and the characteristics 
of the changed response words. 


Frequency of Primary, Secondary, and Tertiary 

Responses 

Table 1 gives the average frequency of the 
primary, secondary, and tertiary responses in 
each of the norms as well as the total frequency 
of the three ranks combined. It is clear that a 
marked increase in the frequency of the most 
popular response has occurred. The primary 
response has on the average increased in strength 
by almost 30% of its earlier value; while the 
most popular response was given by 289 stu- 
dents per thousand in 1927, in 1952 it was 
given by 375 per thousand. This increase is 
also manifest at the second ranking response, 
but the difference has almost disappeared by 
the third response. Overall, the first three re- 
sponses in 1952 accounted for the responses of 
591 students per thousand, while in 1927 they 
accounted for only 489 responses per thousand. 
The change toward more popular responses 
has obviously been very pronounced. 


Changes in Popular Response Words 


The two sets of norms were compared as to 
the identity of particular response words. 
Words ranking first, second, and third in re- 
sponse to a given stimulus in 1927 were com- 
pared to the words ranking first, second, and 
third in 1952. The results are shown in Table 2. 
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TABLE 1 
AVERAGE FREQUENCY OF THE THREE Most Popular 
PONSES ON THE 1927 anp 1952 
Minnesota Norms 








Frequency 








1927 1952 
(N = 925) (N = 1008) 





Ist 267.5 
2nd 111.9 
3rd 72.8 


Total Ist, 2nd, 452.2 


3rd 


TABLE 2 


COMPARISON OF THE RESPONSE Worps OccURRING 
IN THE First THREE RANKS IN THE 1927 AND 











This table shows, as might have been ex- 
pected, that the words which were primary re- 
sponses in 1927 tend to be the primary re- 
sponses in 1952; 71 of the 100 primary 
responses were identical. Fourteen of the 1927 
primaries slipped to the second rank in 1952, 
eight to the third rank, and seven to ranks be- 
low that. The 1927 secondary responses were 
somewhat unstable; 17 meved up to become 
1952 primaries, 14 fell to become tertiaries, and 
33 fell to lower ranks; only 36 remained the 
same. The 1927 tertiaries have an even greater 
dispersion, as the table shows. Overall, 91 
new words entered the top three ranks, and, of 
course, 91 older words moved out. It is clear 
that most of this turnover took place at the 
third rank, with the ranks becoming increas- 
ingly more stable as the strength of response 
increased. 


Strength of Primary Responses to Particular 
Stimuli 


In the face of the facts that some primary 
responses had changed and that primaries in 
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general had greatly increased in strength, it 
was of interest to enquire whether the relative 
strength of a primary response to a given stim- 
ulus in 1927 was related to the strength of the 
primary response to the same stimulus in 1952. 
Accordingly, three correlations were calcu- 
lated: (a) the correlation between frequency of 
the primary responses for ail stimulus words 
on the two occasions, (6) the same correlation 
for the 71 stimulus-response pairs which had 
not changed, and (c) the correlation for the 29 
pairs for which the response had changed. The 
over-all correlation (V = 100 pairs) was + .70; 
the correlation for same-response pairs was 
-+-.65, and the correlation for pairs in which the 
response had changed was +.48. The three 
correlations are all significantly different from 
zero. It is clear that the relative ranks of the 
strength of primary responses to particular 
stimuli remain fairly stable over time. In- 
spection of the scatterplots shows that the 
primary responses with high strength in the 
1927 norms did not tend to change but simply 
increased in strength as popular responding 
increased. The primary responses which 
changed tended to be those which were at 
relatively low strength in 1927. Such responses 
were displaced by close competitors in the 1927 
norms which moved up to become new low 
strength primaries. 


The Total Response Distribution 

The overall distribution of frequencies of 
responses when plotted against response rank 
closely approached a Zipf-type distribution. 
When for the 1952 norms the average fre- 
quency of each response rank was plotted 
against the rank on double logarithm paper, 
the result was almost a straight line from the 
first to the 90th rank. After the 90th rank (at 
which point the average frequency was slightly 
less than one, indicating that at least some of 
the response hierarchies had already been ex- 
hausted) the curve fell away rapidly toward 
zero. This distribution may be compared to 
the rank-frequency distribution for the Kent- 
Rosanoff 1910 norms reported by Skinner 
(1937) and the rank-frequency distribution 
for 200 “other” stimulus words (which were 
not the K-R stimuli) collected by Schellenberg 
(1930) and plotted by Cook and Skinner 
(1939). The present distribution is appreciably 
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more linear than either of the earlier distribu- 
tions and is somewhat steeper, having a slope 
of approximately —1.4 as contrasted with 
roughly —1.3 for the Kent-Rosanoff distribu- 
tion and —1.1 for the Schellenberg “other” 
words. 


The Nature of the Changed S-R Pairs 


The stimulus words which evoked differing 
primary responses in 1952 and 1927 are listed 
in Table 3 with tiieir old and new responses. 
The strength of each response is indicated by 
the percentage of Ss making that response. 

The writers are well aware of the difficulties 
of classifying associative pairs and are €éspe- 
cially wary of “logical” as opposed to “be- 
havioral”’ classifications. Nevertheless it is 
clear that at least nine of the 29 stimulus-re- 
sponse pairs in this group that were popular 
in 1927 were superordinate pairs (i.e., Butter- 


TABLE 3 
STIMULI AND THEIR PRIMARY RESPONSES FOR Pars 
Wauicw WERE DIFFERENT IN THE 
1927 anv 1952 Norms 


Percen- | 
tage of 
Sub- 


| Percen- 


Stimulus 


1927 Response | jects | 1952 Response s 
Making Mak 
the Re- 
| sponse 


Health 
Lamb 
Chair 
Tall 
Moth 
Stop 


Sickness Illness 19 
Mutton Sheep 34 
Comfort Ease is 
Short Long 33 
Butterfly | Insect 20 
Whistle Noise 17 
Wish Desire 20 Want 
Beautiful | Pretty 19 Ugly 
Window Glass | 26 Door 
Citizen Man | 21 U.S. 
Red Color ae White 
Sleep Rest | 25 Bed 
Working | Labor . ¥8 Hard 
Earth | Ground | 17 Round 
Trouble Sorrow Bad 
Cabbage | Vegetable | Head 
Yellow Color | Blue 
Justice Judge 
Bible Book | 
Sheep | Animal 
Bath Water 
Blue Color 
Stove Heat 
Doctor Sick 
Thief Robber 
Lion Animal 
Happiness 
cChid | 


Joy 
Baby 
Food 











| 
Peace 
God 
Wool | 
Clean 
Sky 
Hot 
Nurse 
Steal 
Tiger 
Happy 
Boy 
Crackers | 





Cheese 
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fly-Insect, Red-Color, Cabbage-Vegetable, 
Yellow-Color, Bible-Book, Sheep-Animal, 
Blue-Color, Lion-Animal, and Cheese-Food). 
If one permits a little ambiguity, the number of 
pairs classified as superordinate increases 
(i.e., one may add Whistle-Noise, Wish-Desire, 
Citizen-Man, etc.) The 1952 responses stand 
in sharp contrast to this. None of the new re- 
sponses is clearly of a superordinate nature, 
and only two could be considered to be even 
marginal superordinates (Sickness-Health? 
Butterfly-Moth?). It appears from these data 
that superordinate or classifying responses 
have decreased in popularity since 1927. 

The second major characteristic of the 1927 
responses seems to be synonymity, but it is 
difficult to obtain reliable judgments of this 
relation which are separate from and inde- 
pendent of superordination. The reader may 
judge for himself how nearly synonymous the 
stimuli and responses are. 

The prevailing characteristics of the 1952 
responses seem to be, on the one hand, “co- 
ordination” and, on the other hand, “comple- 
tion’ or some form of sequential relation. Fre- 
quently, these “types” are fused in the new 
stimulus-response relation. As an example, 
Cheese—Crackers may be thought of as a pair 
of coordinates within a class of edibles or as a 
highly frequent sequential combination in 
everyday language, “cheese and crackers.” 
The same is true of Lion-Tiger, Doctor—Nurse, 
Justice—Peace, Red-White, Sickness—Health, 
etc. Again, systems of classification are in- 
adequate to express the difference between the 
norms quantitatively. 

Overall, the changes in primary responses 
seem to suggest that current responses are 
more specific, more concrete, and perhaps more 
determined by high frequency verbal se- 
quences than the older responses. The older re- 
sponses which have been discarded tended 
somewhat more toward abstraction, with the 
most popular responses being those of super- 
ordination and synonymity. 


Discussion 

At this point the import of the contrast of 
the 1927 and 1952 norms needs to be stressed. 
We had suspected from experimental evidence 
that the word association norms had changed, 
but we had not been aware of the nature of 
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the change. Before the data were gathered, 
we had assumed that the 1952 sample would 
show less agreement and greater diversity in 
responses than the 1927 sample. The 1952 sam- 
ple was more heterogeneous (a higher pro- 
portion of youth were in school; veterans on 
the GI Bill represented persons with diverse 
service backgrounds and in many cases in- 
dividuals who would not otherwise have been 
in school, etc.); and the sample was one year 
advanced in education over the 1927 group 
(which is supposed to make for more individual 
responding). In short, we anticipated exactly 
the opposite findings from those we discovered. 
Instead of being more heterogeneous, in word 
association the group was startlingly more 
homogeneous. There had been marked change, 
but it was almost all toward conformity. 

Clearly there was some stability in free 
association data across the 25-year period. 
The high frequency responses tended to persist 
as the most popular responses, and the ranks of 
the most popular responses tended to remain 
the same. On the other hand, we had found a 
greatly increased tendency on the part of the 
1952 Ss to use the popular responses and a 
marked tendency to change low frequency 
primaries of the superordinate sort to other 
primary responses which tended to be more 
specific coordinates or completions. 

The literature offered little aid in explaining 
our data. Kent and Rosanoff (1910) had 
pointed out that their highly educated Ss 
made less use of the popular response words 
than the less well educated Ss in the norm 
group. Cook and Skinner (1939) had used this 
finding to explain the distribution of responses 
on Schellenberg’s (1930) 200 “other” stimulus 
words, where Ss had in fact used the popular 
responses less often and the infrequent re- 
sponses somewhat more often than Kent and 
Rosanoff’s Ss had for comparable ranks on 
different stimulus words. But Cook and 
Skinner were apparently not aware of ihe fact 
that on the 100 K-R stimulus words where 
data were in fact comparable, Schellenberg’s 
sample used the most popular responses more 
often than Kent and Rosanoff’s “mixed adults.” 

We may at this point oppose Kent and 
Rosanoff’s finding and argue that increased 
education increases the use of popular re- 
sponses, thus accounting for the differences 
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between the 1927 and 1910 norms and at- 
tributing the difference between the new and 
the old Minnesota norms to the extra year of 
education, or we may argue that 1952 sopho- 
mores were actually less well educated than 
1927 freshmen (and ignore the contrast be- 
tween the 1927 and 1910 norms) or, finally, 
we may attribute the difference to something 
other than educational level such as the meth- 
ods employed in the various studies or the 
time-lapse and cultural change between 
samplings. The writers feel that the last al- 
ternative is the only one worthy of serious 
consideration. 

If one were going to attribute the difference 
to the method employed, one would pre- 
sumably argue that the group-written form 
produces more common associates than the 
individual-oral form, thus accounting for the 
difference between the 1927 and 1910 norms, 
and then feel free to introduce some other sort 
of variable to explain the difference between 
the 1927 and 1952 Minnesota groups. 

Woodworth and Schlosberg (1954), how- 
ever, feel that the obvious time-pressure of the 
oral-individual form of the test leads to even 
greater use of the popular associates than the 
group-written method. In their chapter on 
“Association” they compare the results of 
individual testing of Brown University stu- 
dents with the 1929 Minnesota norms and 
suggest that the great increase in the frequency 
of popular responses is a consequence of the 
difference in method. Inspection of the limited 
data they present, however, shows a striking 
correspondence to the 1952 norms, which sug- 
gests that the method itself may make only a 
secondary contribution to the high frequencies 
observed. 

An unpublished study by Clousing (1927) 
seems to confirm the negligible effect of method 
(as long as both administrations suggest time 
pressure). In her study, no differences were 
found between the group-written and the in- 
dividual-oral administrations in the frequency 
of primary, secondary, or tertiary responses. 
Keene (1951) asserts that there is little differ- 
ence between the two modes of testing, al- 
though he does not present his group data for 
comparison with his tables of individual data. 
From the data currently available, explana- 
tion of the differences between norms in terms 
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of the differences between individual and group 
testing does not seem satisfactory. 

At this point it was decided provisionally to 
accept the findings concerning the Minnesota 
norms as indicating general trends in associa- 
tive behavior, independent of specific sample 
or methodological differences. We proposed to 
test the findings by examining them in the 
larger context of the 1910, 1925, and 1933 
norms. We felt that three hypotheses were 
testable. 

1. Responses of Ss to the K-R stimuli are 
increasingly concentrated among the popular 
responses for all United States adult samples. 
We proposed to examine the five norms to see 
if there was evidence for a steady increase in 
the frequency of the common responses. 

2. Words used as responses to the K-R 
stimuli tend to change slowly over time, with 
the highest ranking responses having the high- 
est stability. The amount of change for any 
particular rank should be a function of the 
time elapsing between the collections of the 
norm data. 

3. “Abstract” responses to stimuli tend to 
decrease in popularity across the time period 
encompassed by this study. Specifically, if 
superordinates are taken as representative of 
abstractions, they will be found to occur less 
and less often as responses to stimuli, as the 
date of the norms becomes more nearly current. 

It should be noted that these hypotheses 
vary in the amount of support. they receive 
directly from the Minnesota comparison already 
made. The first hypothesis has the most direct 
support, since the recent Minnesota norms had 
already been found to be more concentrated 
in the popular associates than the earlier 
norms, which in turn were known to be more 
concentrated than the Kent-Rosanoff. Only 
if the O’Connor and Keene norms deviated 
from the trend could the hypothesis be denied. 
The second hypothesis, however, has many 
opportunities for confirmation or denial. Each 
set of norm data may be used as a standard, 
and the amount of difference between the 
standard set and other sets may be directly 
calculated. The third hypothesis similarly 
has several opportunities for real tests and may 
be relatively independent of the original data 
which suggested it (the data on changes in 
primary responses alone). Here it was decided 
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to obtain a relatively objective classification 
scheme, classify all superordinates for the 
stimulus words, and examine their frequency 
of occurrence in the several norms. The pre- 
diction, of course, was that a decrease in their 
use would be found over time. 


RESULTS OF GENERAL NorRM COMPARISON 


Frequency of Popular Responses 

Table 4 lists the frequences of the most pop- 
ular responses (first, second, and third rank) 
for all five norm groups. It is immediately clear 
that the simple hypothesis of an increase in re- 
sponse frequencies over time will not by itself 
account for the data. The two norms which are 
closest together in time, 1925 and 1927, are 
widely deviate with respect to response fre- 
quencies. The 1925 norm is very similar in 
frequency to the 1933 norm and more like the 
1952 norm than it is like the 1910 norm. The 
1927 norm is most like the 1910 norm. The 
similarities and differences here are remarkable 
in that they cut across the types of samples 
and methods of administering the tests. 

It is tempting to suggest that two factors 
play a part in the variation found in our data. 
One factor would be the method of test ad- 
ministration (accepting the Woodworth and 
Schlosberg [1954] suggestion that oral adminis- 
tration leads to higher popular response fre- 
quencies) ; the other factor would be the general 
cultural shift which progressively increases the 
use of common responses. At any one time, one 
would expect the oral method to yield higher 
popular response frequencies than the group- 
written method, but over time both methods 
would show the increasing drift toward pop- 


TABLE 4 
AVERAGE FREQUENCY OF THE THREE Most Porutar 
Responses To 100 Stimvunt mn THE Five 
NORMATIVE STUDIES 








1910* 


Response Rank 1925" 192775 15335 1952 





260 86338 


289 357 375 
121 125 136 


First 
Second 122 134 
Third 77 80 79 78 80 


459 552 489 560 591 


* Singular and plural forms of were ¢ in 
computing frequencies to make these comparable to the other 
norms. 
> Ali frequencies expressed as frequencies per 1000. 


Total 1, 2,3 


| 
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ular responses. The data for primary response 
frequencies show parallel lines of increase when 
plotted for oral-individual and group-written 
administrations separately. 

In view of the fact that there is no experi- 
mental support for the first factor and that so 
little is known concerning the actual collection 
of the 1925 norms, this explanation must be 
left as a conjecture and as a suggestion and 
stimulus for further research. 

An empirical finding which is quite clear, 
however, is that at least college populations 
currently show a very high usage of common 
responses. The data for Brown University 
students has already been mentioned as re- 
flecting the high frequencies found in the 1952 
and 1933 norms. In addition, support for this 
assertion is found in the normative compila- 
tion of Meals, Herrick, and Merow (1948) at 
the University of Pennsylvania for 50 of the 
K-R words. These investigators apparently 
selected stimuli which elicited high response 
frequencies as indicated by the 1910 or 1927 
norms. Frequency data for the same fifty 
stimulus words were abstracted from the 1952 
norms for comparison with the Pennsylvania 
data. The proportions responding at each rank 
are paired for the Pennsylvania and Minne- 
sota norms respectively: first rank, .52, .47; 
second, .11, .13; and third, .07, .07. It is easily 
seen that the same intense concentration on 
popular responses is found for Pennsylvania 
college students. 

Overall, with respect to the first hypothesis, 
it may be said that the use of popular responses 
in the free association test is currently at a very 
high level in college populations, whether they 
are tested with the individual-oral or group- 
written method The data strongly suggest 
that the frequency of popular responses is in- 
creasing over time, but that other factors in 
addition to the general trend are important in 
determining popular response frequencies. 


Identity of Words Used as Responses 


Table 5 lists the percentage of identical re- 
sponses at each rank level for each pairing of 
the five normative groups. For the primary re- 
sponses it is clear that the hypothesis that there 
is a steady change with time is acceptable. 
Each norm group shows most overlap with the 
norm group which is closest chronologically. 
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The primary responses in the 1952 norms are 
identical with 73 of the primary responses. in 
the 1933 norms, 71 of the primaries in the 1927 
norms, 65 of the primaries in the 1925 norms, 
and only 58 of the primaries in the 1910 norms. 
Conversely, the 1910 norms show identical 
primaries with 81 responses in the 1925 norms, 
79 responses in the 1927 norms, 74 responses 
in the 1933 norms, and 58 responses in the 
1952 norms. Over the entire table in similar 
fashion all the values are in accord with the 
general hypothesis. 

For the secondary and tertiary responses, 
similar results are found with good con- 
formance to the hypothesis except that the 
1925 and 1910 norms are most alike in both 
cases and the 1933 norms are slightly less simi- 
lar to the 1927 norms than would have been 
expected. From either extreme, 1910 or 1952, 
the progression is quite orderly, however. 

While the 1948 Pennsylvania norms re- 
ferred to above are probably somewhat biased 
by the selection of stimuli known to give high 
frequency response, it is interesting to note 


TABLE 5 


PROPORTION OF IDENTICAL RESPONSES IN THE FIVE 
NORMATIVE STUDIES 








Primary Responses 


1933 1952 


74 58 
79 65 
80 71 

- 73 
73 — 





Secondary Responses 


Norm $$$ . ——— 
1910 1925 1927 1933 

1910 - 53 44 

1925 53 _— 48 

1927 44 48 — 

1933 26 47 43 

1952 26 34 36 


Tertiary Responses 
Nonna , axmnenibi venanininpuaiimaniny enrages 
1925 1927 1933 
1910 36 0 19 
1925 - 29 
1927 27 
1933 27 — 
1952 2 29 
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TABLE 6 
NuMBER OF WorDs DIFFERENT IN THE 
First THREE RESPONSES 
(300 words) 





in passing that they show very high agreement 
with the 1952 norms. Eighty-two per cent of 
the primary responses are identical, 42 % of the 
secondary responses are identical, and 32% of 
the tertiary responses are identical. Inspection 
of Table 5 reveals that these approximate 
very closely the percentages of overlap of the 
1925 and 1927 norms (83, 48, and 32%) which 
are separated by a similar short time period. 

Table 6 presents the other side of the con- 
sistency picture, showing how many different 
words are found in the first 300 response words 
for each of the norm pairs. These data con- 
form closely to the hypothesis except in the 
degree of difference between the 1933 and 
1927 norms. In all other respects the table is 
in accord with the hypothesis. 

Considering all the findings, it seems clear 
that a major determinant of response-overlap 
or response-identity in word association norms 
is the time interval between collections of data. 
Where deviations from this simple prediction 
pattern do appear, there is a suggestion that 
the norms derived by the oral-individual 
method are slightly more alike than would be 
expected from time lapse prediction. That 
other variables are probably active is of course 
conceded, but it is stressed here that time 
lapse alone accounts for most of the systematic 
variation. 


Use of Superordinates 


In order to obtain an unbiased and relatively 
objective standard for deciding which re- 
sponses were superordinates, a written test 
was devised. The test consisted of a set of 100 
sentences of the form “ is a member of the 
class of ——.” Each sentence began with one 
of the Kent-Rosanoff stimulus words. This 
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TABLE 7 


Worps DEFINED AS SUPERORDINATES BY THE 
CRITERION OF THE SUPERORDINATE TEST 








— 


Stimulus | 


| 
> | Fre- 
Super- | 
qodinate on 
Response | — 29) | 


Stimulus 
| 
Red Color | Swift 
Yellow Color | Spider 
Bread Food | Bed 
White (| Color | Ocean 
Green | Slow 
Butter Scissors 
Blue | Butterfly 
Black | Heavy 
Lion | Bible 
Sour Cabbage 
Sheep | Blossom | Flower 
Cheese Sickness | Health 
Hammer Music Art 
Eagle Quiet | Sound 
Chair Furniture | Lamp Furniture 
Table Furniture | River Water 
Bitter Taste | Salt Food 
Sweet Taste | Stomach | Organ 
Fruit Food Anger Emotion 
Mutton Meat 


| Furniture 

| Water 
Speed 
Tool 
Insect 
Weight 

| Book 

| Vegetable | 


Color 
Food 
Color 
Color 
Animal! 
| Taste 
| Animal 
| Food 
| Tool 
Bird 


test was given to 29 students in an introductory 
psychology laboratory course. A superordinate 
response was defined as any sentence comple- 


tion that was given by 15 or more of the stu- 
dents taking the test. We realize that this 
procedure is arbitrary in the extreme, but it 
includes the stimulus-response pairs which 
most judges would agree are “really” superordi- 
nates, provides an unequivocal criterion for 
counting responses, is unbiased with respect 
to the normative data to be judged, and ex- 
cludes words for which superordinates are, to 
say the least, difficult to define. This procedure 
yielded agreement as to superordinate for 39 
of the 100 K-R stimuli. These stimuli and their 
superordinates are listed in Table 7 with the 
frequency with which the superordinate was 
given. 

The occurrences of the superordinate re- 
sponses to appropriate stimuli were counted in 
the 1910, 1925, and 1952 norms where com- 
plete data were available. The 1927 and 1933 
norms could not be used in this comparison 
because only the three most popular responses 
are available for each stimulus. Accordingly, 
a separate tabulation was made of the 26 su- 
perordinate responses which were present in 
the 1927 norms and the 18 superordinate re- 
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sponses which were present in the 1933 norms. 
The data are presented in Table 8. 

Inspection of the table verifies the hypothe- 
sis concerning the decline of superordinate re- 
sponses. The frequency of occurrence of the 39 
superordinate responses identified has de- 
clined in the 1952 norms to just slightly more 
than half the frequency manifested in the 1910 
norms. The frequency of occurrence of the 26 
superordinates present in the 1927 norms (ist, 
2nd, and 3rd ranks), even though biased up- 
ward for that norm set because of selection, 
shows a marked decrease from 1910 to 1952. 
The frequency of occurrence of the 18 super- 
ordinates present in the 1933 norms does not 
even appreciably reflect the selective bias but 
falls steadily from 1910 to 1952. 

A detailed response-by-response Comparison 
confirms this trend as a general case not at- 
tributable to a marked change in a few re- 
sponses. Comparison of the 1910 and 1952 
norms shows decreasing frequencies for 33 of 
the 39 superordinate responses. Further, for 
what might be called the clearest cases of 
superordination (those identified on the super- 
ordinate test with a frequency of 20 or greater), 
24 out of 25 responses have declined. Com- 
parison of the 1925 and 1952 norms shows a 
decline in 31 of the 39 superordinate responses 
and 23 of the 25 “strong” superordinate re- 
sponses. The comparison of the 1910 and 1925 
responses, while not as overwhelming, is still 
conclusive; 30 of the 39 superordinates de- 
clined in frequency and 21 of the 25 “strong” 
superordinates declined in frequency. The 


TABLE 8 
Use oF SUPERORDINATES AS RESPONSES 








1910 39: 1925 1927 1933 1952 





39 Superordinates 
Average percentage 17.5 15.4 


26 Superordinates* 


Average percentage 24.7 22.3 22.6 


18 Superordinates* 


Average percentage 3.1 28.3 27.9 22.4 19.2 





* For the 1927 and 1933 norms only the first three responses are 
available. Twenty-six superordinates occurred in the first three 
ranks in 1927 and 18 superordinates occurred in the first three ranks 
in 1933. All those appearing in 1933 also appeared in 1927. The 
second and third rows of the table are biased upward for 1927 and 
1933, therefore. 
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decline of popularity of the superordinate re- 
sponse appears to be strikingly demonstrated. 


Discussion 


The first hypothesis concerning the general 
increase in the frequency of popular responses 
appears only partially confirmed. It is clear 
from the data that popular responses are cur- 
rently at very high frequencies in college pop- 
ulations, ut the five normative summaries do 
not support the notion of a simple trend in this 
direction over time on all United States adult 
samples. The 1925 sample shows a much 
greater use of popular responses than the 1927 
sample. The most parsimonious explanation 
at this time seems to be a two-factor explana- 
tion recognizing both the drift toward popular 
responses over the time-span included in this 
study and a tendency for the oral-individual 
test technique to generate more popular re- 
sponses than the group-written technique. 

The second hypothesis concerning the 
change in identity of popular response words 
over time seems to be adequately confirmed. 
Norms collected closest in time show the maxi- 
mum amount of overlapping in responses (re- 
gardless of sample differences), and high fre- 
quency responses prove to be much more stable 
than lower frequency responses. There is a 
further suggestion that test technique con- 
tributes to similarity of responses. 

The third hypothesis concerning the decline 
in the use of superordinates as word association 
responses is likewise adequately confirmed. 
A clear and highly significant trend was dis- 
covered and the evidence further indicates 
that the more clearly the responses were identi- 
fied as superordinates, the more marked the 
trend was seen to be. 

Interpretations of these findings are at best 
conjectural. The writers have been tempted to 
discuss the study in terms of the growth of 
“mass culture,” “other-directedness” and simi- 
lar constructs much used today in characteriz- 
ing our culture. The enormous growth of the 
mass media, the standardization of school 
courses, procedures and text books, and the 
presumably greater homogeneity of our verbal 
culture are appealing variables to invoke to 
explain the phenomena manifest in our data, 
but no evidence can now be marshalled to sup- 
port such a discussion. 
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A much simpler explanation for much of the 
data suggests itself; namely, that test-taking 
attitudes have been changing over the time 
period discussed here. The work of Dunn, 
Bliss, and Siipola (1958), Siipola, Walker, and 
Kolb (1955), and to a lesser extent that of 
Flavell, Draguns, Feinberg, and Budin (1958) 
clearly establishes the importance of the time- 
pressure set in producing popular and “‘super- 
ficial” responses, such as contrasts and co- 
ordinates, and decreasing responses which have 
a synonymous or superordinate relation to the 
stimulus. These modern studies of test-taking 
attitude mirror the effects achieved by Wells 
in a much older study of practice effects in 
free association. Wells (1911) gave his Ss 20 
association lists of 50 words (one list per day) 
and then retested them on the first two lists. 
He observed a marked decrease in reaction 
time and a change to “superficial” associations. 
He reports the qualitative changes in response 
for each of his six Ss, and for each one it is 
remarked that the frequency of superordinate 
responses decreased (the decreases vary from 
one-half to two-thirds of the original number of 
responses so Classified). 

It would appear that practice effects (“test- 
wiseness”) and time pressure (“‘test-attitude’’) 
bring about much the same effects: so-called 
superficiality of response, decrease in super- 
ordinates, and increased popular responding. 
If we assume that the oral-individual form 
maximizes time pressure and that more recent 
dates give us more and more test-sophisticated 
Ss we would seem to have all the necessary 
variables required to account both for the 
frequency data and the superordinate data. 

It is true of course that “test-wiseness,” or 
“test-docility” of Ss is after all a general 
product of our culture and that this explana- 
tion may still be related to general questions 
of cultural change on a grand scale. However, 
this explanation has the virtue of greater 
specificity and seems to furnish a point of 
attack for further investigations which the 
more sweeping explanation fails to furnish. 

It should be noted beyond this, however, 
that regardless of the explanation given for the 
increasing frequencies of popular responses 
and decreasing frequencies of abstract re- 
sponses, the systematic change in the over- 
lapping of specific stimulus-response pairs as a 
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function of time-lapse between normative 
studies is in the main separable from these 
phenomena. This systematic change appears 
to the writers to be attributable not only to 
the drift away from abstract responses but 
also to changes in the “meaning” of words 
(e.g., “lamp” in 1910 apparently meant an 
“oil lamp” in contrast to its current meaning) 
and to particular recent experiences concern- 
ing the words which were not duplicated during 
the other norm-collection periods (e.g., in 
1952, a strenuous presidential campaign was 
in progress and “whistle-stop” became a very 
popular associative pair). These changes 
could profitably be studied as part of a general 
study of the change in meaning of common 
words over time. 


SUMMARY 


A comparison was made of word association 
norms collected in 1927 and in 1952 for Uni- 
versity of Minnesota students. It was found 
that primary responses had greatly increased 
in frequency and that the first three responses 
to the stimulus words accounted for 59% of 
all responses in 1952 as contrasted with 49% in 


1927. Seventy-one per cent of the primary 
responses were identical. Primary responses 
were found to be more stable than secondary 
or tertiary responses. The frequency of the 
primary response to individval stimuli corre- 
lated +.70 between the two norms. Primary 
responses which had changed, tended to be 
low-strength superordinate responses which 
were replaced by coordinates and completions. 
Three hypotheses arising from the 1927-1952 
comparison were examined with respect to five 
major collections of free association data: the 
1910 Kent-Rosanoff norms, the 1925 O’Connor 
norms, the 1933 Keene norms, and the 1929 
and 1952 Minnesota norms. The first hypothe- 
sis, that there is a general tendency for the 
frequency of popular responses to increase 
with time, received partial confirmation. It 
was concluded, however, that other factors, 
such as testing method probably exercised 
some effect in addition to the time change. 
The second hypothesis was that the words 
used as responses to stimuli tend to change 
slowly but systematically over time with the 
highest ranking responses having the highest 
stability. This hypothesis was confirmed. 
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The third hypothesis was that abstract re- 
sponses (specifically superordinates) have 
tended to decrease in popularity across the 
time period of this study. Superordinates were 
identified for the Kent-Rosanoff stimuli by 
means of a sentence completion technique. 
The hypothesis was confirmed. 

Possible explanations for these phenomena 
were mentioned and it was suggested that a 
general change in test-taking attitudes might 
account both for the increase in popular re- 
sponses and the decrease in abstract responses. 
It was further suggested that the change in 
individual response terms (aside from the fre- 
quency and abstraction changes) might be 
attributable to changes in the meaning of 
particular stimuli over time. 
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NYSTAGMUS: 
AN INTEGRATIVE NEURAL DEFICIT IN SCHIZOPHRENIA! 


WALTER W. LEACH 
Mississippi Southern College 


HE significance of body image in psycho- 

pathology calls attention to propriocep- 

tor mechanisms. One of the most impor- 
tant is the ves*ibular apparatus because of its 
reflexive adjustment of the body’s position. 
Through integration of impulses from the semi- 
circular canals at cerebellar and oculomotor cen- 
ters, an individual maintains equilibirum and 
orientation to his environment. Mowrer (1932) 
and Holsopple (1929) demonstrated that semi- 
circular canal eye reflexes maintain normal 
visual perception during head movements. 
The observation that some schizophrenic 


patients have a marked disturbance in body 
image and orientation to their environment 
suggests disturbances in their equilibratory 
processes. 

The semicircular canals regulate muscle 
tonus throughout the body. Angyal (1935, 
1936, 1937) describes a schizophrenic syndrome 


in which a bizarre experience of the body oc- 
curs, ¢.g., an impression of heaviness or light- 
ness. Angyl and Blackman (1940) used the 
Barany test to compare the postrotation 
nystagmus of 58 schizophrenic and 20 normal 
subjects (Ss). They lacked a recording device 
and could set no criteria for the eye move- 
ments that constitute a nystagmic beat. In 
addition, they made no observations during 
stimulation. Objections can be raised concern- 
ing the nature of the stimulation in the Barany 
test (10 rev. in 20 sec. with a sudden stop). 
This extremely intense and erratic stimulus is 
greatly beyond appropriate physiological 
limits and frequently produces distressful 
side-effects. 

The primary purpose of this paper is to com- 
pare experimentally groups of schizophrenic 
and normal Ss by measuring characteristics of 
their nystagmus. Amplification of the finding 


! This paper is based on a dissertation presented to 
the faculty of Princeton University in partial fulfill- 
ment of the requirements for the degree of doctor of 
philosophy. The writer wishes to express his apprecia- 
tion for assistance to E. G. Wever and the late J. Q. 
Holsopple. 
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of Angyal and Blackman (1941) required 
apparatus to provide controlled uniform 
angular acceleration and to record nystagmus 
during as well as after rotation. A neural corre- 
late of schizophrenia could provide an objective 
basis for diagnostic classification. In its broader 
implications, this experiment is aimed at a 
unified approach to biological and psychologi- 
cal functioning. 


METHOD? 


Subjects. Seventy-five male inpatients, diagnosed as 
schizophrenic by the staff of the Lyons, New Jersey, 
VA Hospital, made up the experimental group. Their 
ages ranged from 19 to 49 (mean, 29; SD, 8). Pa- 
tients with a record of neurological disease were ex- 
cluded. Average duration of hospitalization was less 
than six months. 

Ninety-six noninstitutionalized males, ranging from 
14 to 76 years of age (mean 26; SD, 10), were in the 
normal control group. Nearly all were university stu- 
dents. The same basis for exclusion applied as for the 
patient group. 

Apparatus. A rotating chair designed by J. Q. 
Holsopple and the writer provided controlled intensi- 
ties of uniform angular acceleration. Since a falling 
weight accelerates at a uniform rate if meeting con- 
stant resistance, an essential requirement was that the 
chair be precisely balanced. During the fall of a weight 
attached to the axis by a cable, the chair accelerated 
until the cable unwound. It then decelerated at a uni- 
form rate as the momentum lifted the weight. The total 
weight of the chair while rotating Ss of varying weights 
was kept constant. Each S was weighed, and lead in- 
gots in a pan attached to the chair were added or sub- 
tracted accordingly. 

The intensity of acceleraticn was determined by the 
amount of weight attached to the cable. Uniformity 
and intensity were empirically calibrated at each value 
of the stimulus. The importance of known uniformity 
is seen if one considers how unsatisfactory an average 
value would be in other sense departments.* 

A standard ophthalmograph (American Optical Com- 
pany, Model 1520) mounted on the foot plate of the 
chair was used to record nystagmus during and after 
stimulation. Start and end of stimulation were signaled 
directy on the ophthalmographic film. The head rest 
aligned the vertical axis of the head with that of the 
chair to stimulate the horizontal semicircular canals. 


2 For a more detailed description se®@ Leach, 1953. 
*For a discussion of the problems involved cf., 
Holsopple, 1929; Mowrer, 1932; and ter Braak, 1936. 
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Procedure. After S was seated in the chair and the 
head rest was adjusted to prevent movement, he was 
instructed to look at an “X” a short distance from 
his eyes. Corneal reflections from each eye were focused 
onto the film. A record of eye steadiness was obtained 
before each rotation. Ss were stimulated in three sepa- 
rate rotations with mild, moderate, and strong accelera- 
tions administered in a sequence of 2, 5, and 12° per 
sec. per sec. Duration of the rotation was 30 sec. at 2°, 
21 sec. at 5°, and 16 sec. at 12°. The first and third rota- 
tions were counter clockwise; the second was clockwise. 

A brief explanation of the purpose was given to each 
S. Cooperation of psychotic Ss was facilitated by elimi- 
nating all cues and any need for communication. The 
lights of the ophthalmograph were shielded so that none 
of the surroundings were visible. 

Measures. The following measures were made from 
the nystagmograms at each stimulus intensity: 

1. Acceleration latency: time in seconds from be- 
ginning of rotation until the completion of the first 
nystagmus. 

2. Deceleration latency: time in seconds from be- 
ginning of rotation until the completion of the first 
reversed nystagmus. 

3. Number of acceleration nystagmi during rota- 
tion. 

4. Number of deceleration nystagmi during rota- 
tion. 

5. Number of postrotation nystagmi. 

6. Duration of postrotation nystagmi: time in sec. 
until (¢) no nystagmus appeared for three sec. or 
more, or (5) until response rate dropped by one-half 
or more. Rotation and postrotation rates were ob- 
tained by dividing the number of nystagmi for each 
period by the duration of each in sec. 

The nystagmograms were read by passing the 35 
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mm. film over a translucent glass in front of a 25-watt 
light bulb. Since the film was exposed at the rate of 4 
inch per sec., time was measured in units of .12 sec. 
along the edge of a ruler calibrated in sixteenths of an 
inch. A nystagmic beat was counted for each sequence 
of slow deviation of the eye tracings from the fixation 
point and a compensatory saccadic return. The direc- 
tion of the eye movements was to either the right or 
left, depending upon (a) whether the stimulus was ac- 
celeration or deceleration, and (6) the direction of rota- 
tion. In a clockwise rotation, the slow phase during 
acceleration was to the left and the compensatory phase 
to the right. This sequence reversed to right-left during 
deceleration. In counter-clockwise rotation, these direc- 


tions were exchanged. 
RESULTS 


The schizophrenic group showed diminished 
nystagmic reactivity for all of the measures 
except deceleration latency. Their deficits 
during stimulation appeared in their longer 
reaction times and slower rates of response. 
Postrotation nystagmus showed a similar 
reduction. The percentage deficit and the 
statistical significance cf the differences be- 
tween the schizophrenic patient and the nor- 
ma! control groups are given in Tables 1 and 2. 

The amount of deficit varied, depending 
upon the intensity of stimulation. It was least 
to strong stimulation and greatest for weak 
stimulation for all measures except number of 
deceleration nystagmi. As the stimulus inten- 


TABLE 1 
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TABLE 2 
CoMPARISON OF PostROTATION Nystacmus AT 3 Stiwutus INTENSITIES IN 96 NoRMAL AND 75 ScHIzOPHRENIC Ss 
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Probability 





(#) (SD) 
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Number 
2° per sec. per sec 
5° per sec. per sec. 
12° per sec. per sec. 
Duration in seconds 
2° per sec. per sec. 
5° per sec. per sec. 
12° per sec. per sec. 
Postrotation rate: nystagmi per second 
2° per sec. per sec. 
5° per sec. per sec. 
12° per sec. per sec. 





sity increased, the amount of deficit decreased 
in the more significant measures, acceleration 
latency and rotation rate. This inverse rela- 
tionship between intensity and deficit is seen 
most clearly in acceleration latency. In the 
normal controls, the latency of acceleration 
nystagmus was nearly constant at the mild, 
moderate, and strong intensities used in the 
experiment.‘ The latencies of the schizo- 
phrenic group, on the other hand, became 
shorter and approached those of the controls. 

Deceleration latency is a special instance of 
reactivity in which the schizophrenic group 
showed no significant deficit. Once responding 
to acceleration stimulation, both groups re- 
versed their reaction to deceleration with equal 
facility. After reversal had taken place, how- 
ever, the schizophrenic group at first (during 
rotation! showed least deficit to the moderate 
stimulus. In keeping with the general pattern, 
postrotation deficits were least to the strong 
stimulus. 

Postrotation nystagmus is a phenomenon 
of receptor adaptation and recovery (Stein- 
hausen, 1933). This after response corresponds 
to afterimages of rotation which are analogous 
to afterimages in other sensory modalities. 
The data in Table 2 amplify in terms of known 
stimulus intensities the finding of Angyal and 

‘ Differential sensitivity of the right and left laby- 
rinths may account for the slightly longer latency found 
for the controls at 5° per sec. per sec. than at 2° and 12° 
per sec. per sec. Rotation was clockwise at 5° per sec. 
per sec., which ipsilaterally stimulates the right semi- 
circular canals. At 2° and 12° per sec. per sec., the left 
semicircular canals were stimulated ipsilaterally. 
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Blackman (1941) of postrotation nystagmic 
deficit in schizophrenia.*® 


DISCUSSION 


The direct relationship of nystagmus to the 
physiology of the semicircular canals has been 
shown by Steinhausen (1933), Dohiman 
(1935-36), and Lowenstein and Sand (1940). 
Their work established angular acceleration 
as the stirnulus. In any head or body move- 
ment, eye fixation is maintained by semi- 
circular canal reflexes. In the experimental 
situation, the fast phase has an adaptive func- 
tion. Following the eye’s slow deviation, the 
fast phase re-establishes the original fixation. 
This occurs when the visual field rotates in a 
constant relationship to the S. 

An idea of the complexity of the process 
involved can be gained from a consideration 
of the events in experimental rotation. Acceler- 
ation occurs during the first part and decelera- 
tion during the last, when the direction of the 
slow and fast phases of nystagmus reverses. 
Deceleration nystagmus perseverates after 
stimulation (postrotation nystagmus). In a 
clockwise rotation, the appropriate sensation 
of rotation to the right is experienced during 
acceleration. In the bref interval when the 
stimulus changes from acceleration to decelera- 
tion, the rotation seems to stop. There follows 
the illusion of rotating in the opposite direction. 
This illusion continues into the postrotation, 
period concomitant with postrotation nystag- 
mus. 

® See also Angyal (1941, 1940, 1942). 
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The results indicate schizophrenic deficits in 
each stage of this experimental sequence except 
deceleration latency. They do not indicate 
where in the neural chain of events (e.g., the 
semicircular canals) impairment, if any, might 
lie. The most conservative inferences suggested 
are those related to the complexity and inte- 
gration of the neural processes. 

1. Of foremost importance is the integrative 
or adaptive nature of the neural deficit. Con- 
sidering the proprioceptor and orienting roles 
the semicircular canals play, the nature of the 
deficit suggests a biological concomitant of 
personality disturbances similar to those as- 
sociated with the body image. 

2. The results also emphasize the dimen- 
sional nature of deficit in schizophrenia. 
Nystagmic reactivity for both groups falls into 
fairly normally distributed measures, and their 
standard deviations reveal considerable over- 


lap. 

3. Adequate neural function was demon- 
strated in at least one aspect of schizophrenic 
adaptation. When the stimulus or stress re- 
versed from acceleration (positive) to decelera- 
tion (negative), no significant deficit in latency 


of deceleration nystagmus seemed to occur. 
Various social and personal aspects of schizo- 
phrenia may be subject to a similar principle. 

4. With increasing intensity of the stimulus, 
schizophrenic deficit decreased. This tendency 
is demonstrated most clearly in latency of 
nystagmus to onset of rotation. Schizophrenic 
deficit was less than one-fourth for moderate 
and strong intensities than for a mild intensity. 
In its more general implications, this relation- 
ship is suggested by stress concepts of Seyle 
(1956). The greater the intensity of the stress, 
the closer organisms may approach the uni- 
formity of the general adaptation syndrome. 

5. Confirmation of findings of Angyal and 
Blackman (1940, 1941) of reduced postrotation 
nystagmus in schizophrenia was experimentally 
established. Their postrotational mean rates 
of 1.32 for the patients and 1.68 for the normal 
controls show a schizophrenic deficit of 21%. 
Even though the deceleration stimulus pro- 
duced by the Barany method is estimated at 
upwardsof 1000° per sec. per sec., post rotation 
deficits are greater at 2° and 5° per sec. per sec. 
(30% and 24% respectively). 

The procedure was especially applicable to 
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mentally ill Ss with communication difficulties. 
Minimal cooperation entirely of a nonverbal 
nature was required of the Ss. The automatic 
nature of the response prevented S’s rapport 
from affecting the measurements. 

Other advantages were provided by the 
experimental conditions. None of the Ss were 
apparently bothered by aftereffects of rotation. 
The functional relationship between stimulus 
intensity and nystagmus could be established 
because both were precisely quantified. Note- 
worthy is the comparison of the Barany 
method to that employed in the experiment 
reported here. Stimulus levels far below the 
shock level of that clinical technique were more 
adequate in identifying the degree of neural 
integration. 


SUMMARY 


This experiment demonstrated some aspects 
of neural integrative adequacies and deficits 
in schizophrenic function. The nystagmic reflex 
is a useful measure because of its adaptive 
nature related to an individual’s thresholds, 
visual-motor orientation, and proprioception. 
Certain conclusions and implications may be 
enumerated. 

1. Schizophrenics as a group have deficits in 
latency and amount of nystagmus varying 
with intensity of stimulation. 

2. Adequacy of schizophrenic integration 
and sensitivity increased as intensity of stimu- 
lation increased. 

3. No deficit occurred when the stimulus re- 
versed from positive to negative (acceleration 
to deceleration). 

4.A mild stimulus was more effective in 
demonstrating deficit than stronger stimuli. 
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SELF-ESTEEM AND NEED ACHIEVEMENT AS DETERMINANTS 
OF SELECTIVE RECALL AND REPETITION: 


STANLEY COOPERSMITH 
Wesleyan University 


ERSONALITY theorists ascribe the selec- 
Pp tivity of memory to the screening of 
experiences in terms of their accept- 
ability to the ego. Stimuli that threaten the 
person’s self-esteem are assumed to be less 
available to consciousness while ones that are 
acceptable or enhancing are permitted freer 
expression. Anxiety, threat, stress, and in- 
volvement are terms that have been used to 
describe the psychological state which ensues 
when there is the possibility of lowered self- 
esteem. Psychological defenses against anxiety 
maintain as positive a self-evaluation as is 
possible, and, as Freud notes, “defenses proceed 
from the self-respect of the ego” (Freud 
1924). Studies of success and failure have 
suggested (Alper, 1957; Eriksen, 1952, 1954) 
that where self-esteem is firmly established, 
failures are more readily recalled and per- 
ceived, while insecure or lowered self-esteem 
results in denial and heightened thresholds for 
negative stimuli. Similar investigations relating 
to achievement motivation (Alper, 1957; Atkin- 
son, 1953) strongly suggest that the desire to 
overcome a failure is a further influence upon 
the direction of recall, and one that is likely 
to be expressed in behaviors designed to 
overcome the affect thereby aroused. The 
purposes of the present experiment were to 
investigate the relationship between several 
indices of self-esteem, achievement motiva- 
tion, and the recall and repetition of success 
and failure experiences; and examine the total 
and positive and negative aspects of n Achieve- 
ment within the context of self-esteem types 
previously found to be reliable and consistent 
(Coopersmith, 1959), and determine the man- 
ner in which these aspects are related to these 
experimental behaviors. 
Task interruption has been widely used in 
experiments concerning selective recall. Zeigar- 


1 The research cited in this paper originated under 
USPHS Grant M (901) and is currently supported by 
USPHS Grant M(2696). An earlier version of this paper 
was presented to the 1959 meeting of the American 
Psychological Association. 


nik (1927) introduced the method with the 
finding that incompleted tasks were recalled 
more frequently, and subsequent studies have 
attempted to determine the variables asso- 
ciated with different recall ratios. Converging 
lines of evidence now indicate that the recall 
pattern is a function of the stress introduced 
into the situation, individual differences in 
ability to tolerate the stress, and motivations 
aroused by its introduction. Studies of group 
trends (Alper, 1946; Glixman, 1949; Rosen- 
zweig & Mason, 1934; Rosenzweig, 1943), 
although inconsistent, tend to indicate that 
as threat to self-esteem increases, the recall of 
incompleted tasks decreases, while the recall 
of completed tasks remains fairly constant. 
A number of early investigators remarked upon 
the personality characteristics of subjects who 
recalled more failures. They were variously 
described as proud (Rosenzweig & Mason, 
1934), ambitious (Zeigarnik, 1927), and need- 
ing success (Cartwright, 1942), and older 
(Sanford, 1946), and more mature (Rosen- 
zweig & Mason, 1934). Recent studies, more 
empirical and dynamic in formulation, have 
investigated personality dimensions broader 
and more basic in nature. Recall of failure 
under stressful conditions has been reported 
for persons high in ego strength (Jourard, 
1954; Tamkin, 1957) and achievement motiva- 
tion (Atkinson, 1953), who show relatively 
little perceptual defense (Eriksen, 1952) and 
are more sure of their personal and intellectual 
worth (Caron & Wallach, 1957; McArthur, 
1954). The interaction of situational and 
personality variables results in four possible 
recall patterns, two of which, (a) a shift from 
failures to successes under stress, and (b) a 
shift from successes to failures under stress, 
have been reported. The two other patterns 
(c) recall of successes under all conditions, 
and (d) recall of failures under all conditions, 
have been suggested (Alper, 1948) but not 
reported. 

In the present experiment subjects differing 
in subjective and behavioral evaluations of 
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self-esteem, and representing different types of 
self-esteem (Coopersmith, 1959), were in- 
dividually presented a series of tasks, half of 
which were failed. A measure of achievement 
motivation was then obtained under what were 
designed to be stressful and arousing condi- 
tions. Selective recall and the subject’s deci- 
sions to repeat either successes or failures 
were then elicited. 

Repetition behavior was obtained on the 
assumption that it is a more concrete con- 
frontation with previous experience than is 
recall, and may thus be more anxiety pro- 
voking. Furthermore, repetition involves a 
commitment of future behavior and thus 
provides an opportunity for testing the hy- 
potheses (Caron & Wallach, 1957; Osgood, 
1952) that recall is a cognitive precursor to 
behaviors that the person has previously found 
to be constructive. Repetition differs from 
resumption (Ovsiankina, 1928) in that it 
controls for differences in point of interruption 
on failed tasks, and provides an opportunity to 
redo all tasks in their entirety without dis- 
tinction between successes and failures. 


METHOD 
Subjects 


The sample consisted of 48 middle middle- and 
upper middle-class children enrolled in the fifth and 
sixth grades of an urban elementary school.* Four 
groups, of 12 Ss each, were chosen from a group of 102 
children on the basis of their responses on the Self- 
Esteem Inventory (SEI), and a teacher’s rating of their 
self-esteem behavior as recorded on the Behavior 
Rating Form (BRF). The four groups were found to 
differ significantly on a constellation of variables 
(achievement, sociometric status, ideal self, and n 
Achievement) and may be briefly characterized as 
follows: High-High (SEI-BRF)—above average per- 
formance commensurate with goals; High-Low—low 
achievers with high goals; Low-Low—low achievers 
who have accepted their inferior performance and 
position; and Low-High—high achievers with extremely 
high goals but with low self-esteem. It thus becomes 
possible to analyze patterns of recall and repetition not 
only in terms of the four types of self-esteem, but also 
to compare the results of groups high (H-H, H-L) and 
low (L-L, L-H) in subjective evaluation, and groups 
rated high (H-H, L-H) and low (H-L, L-L) on their 
self-esteem behaviors. The measures and methods used 
to select these Ss and the elaboration of the self-esteem 





*The author wishes to thank the superintendent, 
principals, faculty, and students of the Ithaca Public 
School System for their cooperation with this study. 
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types have been presented in a previous paper 
(Coopersmith, 1959). 

In deciding which tasks should be employed, the 
following criteria were established: (a) equal attractive- 
ness; (b) capable of accomplishment by every child; 
(c) requiring approximately the same amount of time; 
(d) generally familiar; (¢) interesting; (f) success or 
failure related to the S’s performance; and (g) relative 
unlikelihood that Ss would interpret their accomplish- 
ment in terms of partial success and failure. In general, 
it was found that tasks requiring simple motor coordi- 
nation were most effective. 

In order to control for recall or reptition due to the 
nature or attractiveness of the task, a success and failure 
variation of each task was employed. Only those tasks 
rated approximately equal in attractiveness were used. 
Although this use of alternates of the same task would 
tend to increase the likelihood that the first would act as 
a mnemonic to the second, the tasks were sufficiently 
different and presented in such an order and manner 
that their relatedness was not clearly apparent. The 
six pairs of tasks consisted of: (¢) Mosaics—S was 
asked to replicate a design of a boy and a flowerpot 
using plastic heptagonal blocks; (6) Cut-outs—S was 
shown a paper cut-out of an animal after which the Z 
disassembled it and asked the S to reproduce it; (c) 
Scrambled sentences—S was asked to arrange the 
words presented to him on cards into a meaningful 
sentence; (¢d) Kohs blocks—S was asked to reproduce a 
design of a house and a series of stripes; (¢) Jig-saw 
puzzle—S was shown a completed puzzle after which the 
E disassembled it and asked the S to reassemble it; 
and (f) Snap-block construction—S was asked to 
reproduce a design of a horse and a robot using varied 
shaped blocks that snapped together. 


Procedure 


Task administration. The set of 12 tasks was ad- 
ministered individually to each S under conditions 
designed to threaten self-esteem. Z had previously 
announced to the class that they were to participate 
in a study designed to test the attitudes and achieve- 
ment of children. The selected S was taken to a private 
office in the school building, seated before a desk, at 
the side of which sat EZ, stopwatch in hand. S was 
informed that he would be permitted two minutes to 
complete each task and that EZ would time him and 
record the nature of his performance. The tasks were 
out of S’s sight until Z presented them individually in 
the sequence S FSS F FS F FSS F. They were 
rotated so that success tasks occurred an equal number 
of times in each success position and failure tasks an 
equal number of times in each failure position. Success 
and failure were controlled by timing, with the differ- 
ence between success and failure variations of each 
task generally less than 30 sec. Analysis indicated no 
effect due to either serial position or point of inter- 
ruption. At the conclusion of each task EZ announced 
that S had either succeeded or failed upon the task; 
he repeated this before proceeding with the next task. 

Measurement of n Achievement. After £ had placed 
the last task out of view, he asked S to complete four 
stories that had previously been found (McClelland, 
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Atkinson, Clark, & Lowell, 1952) to elicit n Achieve- 
ment in children. This measure was included to deter- 
mine motivation under arousal conditions. It provided 
a “filler” between performance and recall, and made it 
possible to evaluate the relationship between achieve- 
ment motivation and recall as previously demonstrated 
(Atkinson, 1953). 

The story completions dealt with: a boy (or girl) at 
school; brothers and sisters playing; two men (or 
women) near a machine; and a boy (or girl) seated at a 
desk. As the child presented the stories orally, they were 
recorded by EZ. The average duration of this period was 
approximately 15 min. It had previously been deter- 
mined (LicClelland et al., 1952) that stories were more 
successful than pictures in eliciting achievement motiva- 
tion from children. 

The scoring of the stories followed the established 
criteria (McClelland et al., 1952), each story being 
scored separately for the presence or absence of each 
of the 13 achievement related categories. They were 
further grouped into the positive (I+, Ga+, G+, 
Achievement Thema) and negative aspects (I—, Ga—, 
G—, B) subsumed under the total n Achievement score, 
aspects which are apparently “idicative (McClelland 
et al., 1952) of hope of success and fear of failure 
respectively. A product-moment correlation of .85 was 
obtained between the total n Achievement scores of 
two independent judges. 

Obtaining recall and repetition. After S had completed 
the last of his stories, Z asked him to report which of 
the tasks he remembered most clearly of all upon which 
he had previously performed, then as many others as 
he could recall. Pretests had established that a large 
percentage of the children could recall all the tasks 
and it was therefore decided to include a measure of 
‘clearest” recall, with instructions indicating the 
acceptability of any recall preference. 

E then led the child behind a curtain to where the 
tasks, all completely assembled, were located. He 
indicated a pair of tasks, offered S an opportunity to 
repeat either the success or failure variation, and after 
the decision had been made allowed him to redo the 


TABLE 1 


MEAN NuMBER OF SuccESSES RECALLED BY 
DIFFERENT Types or SEL¥F-EsTEEM 


Fail/ 
Succ. 
Ratio 


First, 


Three” Total 


Initial 


Low-Low 4.7 .84 
Low-High - , 5.1 .89 
High-Low sa ; 4.8 91 
High-High : 5.3 .83 
Pe >.” — 


* The terms refer to upper vs. lower quartile status on the 
Self-Esteem Inventory and the Self-Esteem Behavior Rating 
Form, respectively 

> The individual scores for the first three, and total recalls 
were transformed into percentage success form. The recall per- 
centages were then transformed into angles by the arc sine per- 
centage transformation. 

® P is based on x* distribution for initial recall, and on the F 
distribution for the first three and total recalls. 
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TABLE 2 


FREQUENCY OF RECALL AND REPETITION 
PATTERNS FOR SELF-EsTeEEM TyPEs 





Repetition 


More . More re 
Fail. 24%! Succ. ij, Baual 


More 
Succ. 


High-High 





task. Decisions were obtained for the other pairs of 
tasks, although S did not actually undertake the task 
again in the remaining five cases. Before leaving he 
was asked to keep the entire procedure in confidence. 
Insofar as is known, no breach of confidence occurred. 


RESULTS 
Self-Esteem 


In Table 1 the recall behaviors of the 
different types of self-esteem are compared. 
In order to test for a possible sequential effect, 
initial recall, first three recalls, and total 
recall of successes were noted. The most 
striking feature of these results is the lack of 
any significant differences between the self- 
esteem groups. The Low-High and High-Low 
groups showed a tendency to recall more 
failure on the first recall but this difference is 
well within chance expectancy. On the first 
three recalls all groups recalled at least twice 
as many successes as failures, while the ratio 
of failure to success of less than one for total 
recall corroborates the general tendency for 
the recall of success tasks. The possibility that 
Ss in each group would differ in the frequency 
with which they recalled more successes, more 
failures, or an equal number of each is refuted 
by Table 2. Chi square for the distribution of 
Ss falling into each category was clearly not 
significant. 

Comparison of the groups high (H-L, H-H) 
and low (L-H, L-L) in subjective evaluation 
reveals no differences on any of the recall 
measures, nor does comparison of the groups 
high (H-H, L-H) and low (L-L, H-L) on 
ratings of self-esteem behavior. The present 
findings, then, do not support the view that 
selective recall is related to subjective or 
behavioral indices of self-esteem, nor do 
children representing different types of self- 
esteem differ in their recall patterns. These 
results are consistent with previous studies 
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TABLE 3 


CORRELATIONS OF MEASURES RELATED TO 
RECALL AND REPETITION 








No. of Suc- No. of Suc- 
cesses Re- cesses Re- 
called peated 


Variable 


Number of successes recalled — -. 
Self-Evaluation (SEI) 
Behavior Evaluation (BRF) 
Objective Achievement (Iowa) 
D(SB)* 
Manifest Anxiety , 
Ideal Self .02 
Need Achievement .12 
D(SI) adjusted” .16 
Note.—Value of r for N = 48, .05 level = .28, .01 level = .37. 
* Difference between Self and Behavioral Evaluations. 
> Difference between Self and Ideal Self, adjusted for “‘ceiling” 
effect. 


(Caron & Wallach, 1957; Glixman, 1949; 
Smock, 1957) in the finding that there is a 
greater recall of successes than failures under 
stressful conditions. 

For task repetition, the dependent variable 
is the total number of success tasks repeated. 
Table 2 indicates that Ss in the Low-High and 
High-Low groups tended to repeat failures 
while Low-Lows and High-Highs repeated 
successes. Thus, groups showing agreement 
between SEI and BRF scores repeated twice 
as many successes as failures, while cases of 
disagreement repeated failures twice as often 
as successes. The comparison of groups high 
and low in self-evaluation, and the groups 
high and low on their behavioral ratings show 
no differences in repetition. 

In the course of preparing the profiles of the 
self-esteem types (Coopersmith, 1959) infor- 
mation on several personality and motivational 
variables had been obtained. A correlational 
analysis was performed to determine the 
nature and extent of the relationships between 
these measures and the recall and repetition 
of successes. The results of this analysis are 
presented in Table 3. The analysis includes 
data based on three scores derived from 
differences between measures. The first of 
these is the D(SB) score, based on the differ- 
ence between subjective and behavioral evalu- 
ations. Since the groups with high positive 
(H-L) and high negative (L-H) D(SB) scores 
behaved in identical fashion in the repetition 
experiment, the direction of the discrepancy 
will not be considered in this analysis. The 
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D(SI) and the D(SI) adjusted scores are 
based upon the difference between subjective 
and ideal self-statements, with the latter score 
adjusted to correct for the ceiling imposed on 
the D(SI) score by persons high in their self 
evaluation. Since the correlation between the 
two D(SI) scores was significant (r = .64, 
p<.01) only the adjusted score is included. 

None of the variables in Table 3 appear to 
be significantly related to the number of 
successes recalled. The only correlation ap- 
proaching significance was between the number 
of successes recalled and number of successes 
repeated (rf = —.26, p<.08). Of the four 
other variables related to the repetition of 
successes, achievement motivation was clearly 
significant (p<.01), while the D(SB) score 
(p<.06), self evaluation (p<.09), and ob- 
jective achievement (p<.10) approached ac- 
cepted significance levels. Partial correlations 
indicate that n Achievement and the D(SB) 
score, the two best predictors, are each inde- 
pendently related to repetition behavior. The 
beta coefficients of the multiple correlation of 
repetition with the n Achievement, D(SB), and 
SEI scores combined (r = .49, p<.01), signify 
that these variables contribute respectively 
45%, 34%, and 21% of the relationship. The 
coefficient of multiple determination (R? = 
.2349) indicates that these variables combined 
can account for 23% of the variation in repe- 
tition. 


Achievement Motivation 


The finding. that the two self-esteem types 
(High-Low and Low-High) that are high in 
achievement motivation but significantly dif- 
ferent in other characteristics, both repeat 
failures, raises the question of whether their 
actions are based upon similar underlying 
motives or upon different motives which 
produce the same effect. A number of studies 
(Alper, 1957; Atkinson, 1953; McClelland et 
al., 1952) indicate that the total n Achievement 
score includes a positive aspect indicative of 
an approach orientation and a motivation to 
overcome obstacles, and a negative aspect 
oriented toward avoiding failure. On the basis 
of the curvilinear relationships found in studies 
of perception (McClelland et al., 1952) and 
recall (Atkinson, 1953), it has been proposed 
(McClelland et al., 1952) that negative re- 
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sponses occur with greater relative frequency 
in the middle and lower portions of the distri- 
bution of total n Achievement scores. 

Examination of Table 4 indicates that there 
are significant difierences between the groups 
in positive and negative subcategories as well 
as in the general categories. While all groups 
have more responses in the positive than in the 
negative categories, the Negative/Positive 
ratio indicates that the High-Highs have the 
highest proportion of positive responses (and 
are thus presumably approach-oriented), the 
Low-Lows the highest proportion of negative 
responses (and are thus presumably avoidant- 
oriented), while the two groups showing 
discrepancies between self-ratings and be- 
havior have more of a balance between positive 
and negative categories, a situation which 
would presumably lead to conflict. These 
findings are consistent with the personality 
characteristics and dynamics of the self-esteem 
types (Coopersmith, 1959) and support the 
belief that they represent distinct character 
structures. 

Inspection of Table 4 further reveals that 
of the two groups which are low in achievement 
motivation the Low-Low self-esteem group 
shows the greatest percentage of negative 
responses, whereas the High-Highs show the 
greatest percentage of positive responses. The 
findings indicate that the positive and negative 


TABLE 4 


Torat NuMBER OF RESPONSES OF THE 
Setr-Esteem Groups in GENERAL 
CATEGORIES AND POSITIVE AND 

NEGATIVE SUBSCORES 


General 
Categories 


Total n Ach 
and Subscores 





‘a Neg./ 
| | Pos. 
AI* | TI® | ur 


Neg- |Posi- 
ative| tive 
Low-Low 22 | 15 | il 16 
Low-High 33 | 5} 10 26 
High-Low 23 | 12 | 13 21 | 34| .62 
High-High 19 | 12 | 17 | 8| 32] .25 


* . * 














*p < 05 

** < O01. 

* Achievement Imagery—Indicated by long term involve- 
ment, unique accomplishment, or competition with a standard of 
excellence. 

> Task Imagery—Some reference to achievement but not ex- 
plicit in terms of AI criteria 

* Unrelated Imagery—No references to achievement. 


STANLEY COOPERSMITH 


TABLE 5 


POSITIVE AND NEGATIVE SUBCATEGORIES OF 
ACHIEVEMENT MOTIVATION BY 
Setr-Esteem Groups 





Tot. 
I+ Gat G+ AchTh pot 





Positive 








Low-Low 9 
Low-High 14 14 
High-Low 12 il 
High-High 10 


Ga- 





Negative I- 





Low-Low 
Low-High 
High-Low 
High-High 
* 9 < 0S. 
"> < 01. 
Note.—Key to subcategories: 
Positive 
I+ Successful instrumental! activity 
Ga+ Anticipation of success 
G+ Positive affect related to achievement 
Ach Th Achievement imagery central theme 
Negative 
I— Unsuccessful instrumental activity 
Ga— Anticipation of failure 
G— Negative affect related to achievement 
Bw Environmental obstacle 
Bp Personal obstacle 


categories may vary independently of each 
other and of the total n Achievement score. 

With respect to responses within the sub- 
categories, Table 5 indicates that the major 
difference for the positive subcategory is in 
Achievement Thema responses while that of 
the negative aspect is in Bw responses repre- 
senting impersonal obstacles to achievement. 
The difference in the negative category remains 
significant when the Bw responses are removed, 
whereas the difference in the positive category 
no longer holds with the removal of Achieve- 
ment Thema responses. 

In order to take account of the differences 
between the groups in Achievement related 
stories, the total positive and negative cate- 
gories as well as the specific responses for each 
S were divided by the number of his AI stories. 
This analysis indicated significant differences 
for only the total negative category and the 
Achievement Thema and Bw responses. 

Correlational analysis of the achievement 
motivation subcategories with recall and repe- 
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tition revealed a significant negative relation- 
ship between total positive responses and the 
number of successes repeated (rf = —.36, 
p<.01). The correlation for the total positive 
alone (rf = —.36) is similar to that attained 
for the total n Achievement score (r = —.38). 
None of the other correlations between cate- 
gories and responses and the experimental 
behaviors, whether by type or for the total 
group, achieved accepted levels of significance, 
thus raising the possibility that where differ- 
ences were obtained they could be accounted 
for by chance. A further limit on interpreting 
the relationship between n Achievement and 
repetition is set by the recognition that the 
groups which were high in their D(SB) scores 
were also high in n Achievement, thus raising 
the possibility that the findings could be due 
to the effects of either variable, of neither, or 
of yet another variable as yet unidentified. 
Interpretation must therefore await experi- 
mental evidence indicating the significance of 
the self-behavioral discrepancy, and the inde- 
pendent effects of the n Achievement and 
D(SB) variables. 


DISCUSSION 


The results of the present study clearly do 
not support the hypothesis that either achieve- 
ment motivation or self-esteem are related to 
recall preference. The general result indicating 
a greater recall of successes under stress is in 
agreement with the results of previous studies 
(Glixman, 1949; Rosenzweig, 1943; Smock, 
1957), despite a bias introduced in the sample 
selection. Of those cases where a greater recall 
of failures under stress has been reported, only 
one (Atkinson, 1953) indicates an absolute 
majority of failures, while the others refer to 
either the recall pattern of particular indi- 
viduals (Alper, 1946; Atkinson, 1953; Caron 
& Wallach, 1957; Eriksen, 1952) or to an 
increase relative to some previous finding 
(Alper, 1948; Glixman, 1949; Rosenzweig, 
1943; Smock, 1957). Thus, while the suggested 
relationships of this study are not borne out, 
the group results support the general finding 
of a recall bias favoring success. 

It is this very absence of relationships be- 
tween the battery of variables and recall, as 
well as the small proportion of individuals in 
the sample who recall more failures, which 
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raises serious yet intriguing questions. Turning 
first to situational and task variables for an 
explanation, it may be argued that Ss in the 
present experiment may have been placed 
under greater stress than is generally the case, 
and since one of the effects of stress is to de- 
crease the number of failures recalled (Glixman 
1949; Rosenzweig, 1943; Smock, 1957) super- 
ordinate stress should presumably accentuate 
this trend. While this may have been the case, 
E’s observations and the comments by the Ss 
did not indicate an arousal greater than that 
reported by others investigators. In addition, 
recall experiments differ on such variables as 
the number of tasks used (4-30), the method 
of administration (group or individual), the 
type of task (scrambled sentences, drawing, 
puzzles, problems, etc.), the length of tasks, 
the length of time between performance and 
recall, and whether or not a “filler” activity 
is introduced. 

A further reason that may be advanced is 
that the use of alternates of the same task 
may have acted as a mnemonic and thereby 
confounded the recall pattern by increasing it 
in toto. Here the matter would seem to be one 
of strategy since it is clearly necessary to 
control for the nature and attractiveness of 
the task, and while each of the tasks may well 
act as a mnemonic for the other, procedures 
which do not control this variable are subject 
to the contrary rejoinder. While these differ- 
ences make comparisons difficult, it should be 
noted that the procedures employed in the 
present study are essentially similar to those 
employed in studies reporting a much larger 
percentage of Ss recalling more failures than 
successes (Alper, 1946; 1957; Caron & Wallach, 
1957; Glixman, 1949; Jourard, 1954; Tamkin, 
1957). 

The most likely explanation for the present 
findings appears to be that the Ss were younger 
than those generally employed in recall studies. 
It is clearly not the passage of time but rather 
the quantity of experiences as well as their 
quality (success and failure) which is pre- 
sumably related to recall preference. The two 
characteristics linked to maturity that appear 
to be related to recall are: (a) the development 
of a tolerance for the anxiety aroused by 
failure and a perspective upon its significance, 
an ability subsumed under such terms as ego 
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strength and self-esteem and (6) an attitude 
towards failure as frustrating and insulting 
which results in behavior designed to change 
it to success, a measure of which is apparently 
obtained in the achievement motive. Persons 
high in both these characteristics should recall 
failures under stress and nonstress, while those 
low in both these characteristics should recall 
successes under both conditions. Persons with 
intermediate amounts of the two, in varying 
combinations, may be expected to recall either 
success or failure, depending on the extent to 
which the situation is perceived as failure and 
the extent to which n Achievement is aroused 
by situational cues. 

A good deal of the controversy as to whether 
the recall of failures is indicative of a weak 
(Alper, 1946; Rosenzweig & Mason, 1934) or 
strong ego (Sanford, 1946) appears due to a 
failure to distinguish between the concepts of 
appropriateness and strength. Appropriateness 
refers to a response suitable to a particular 
situation and may be acquired through rela- 
tively superficial social learning, while psycho- 
logical strength refers to the ability to tolerate 
adversity and less need to distort and restrict 
reality, thus increasing the probability that 
adversity will be overcome. The former is 
based upon familiarity with a given environ- 
ment and perceptual sensitivity to its cues, 
while the latter is achieved by states of high 
motivation, a wide range of experiences, and 
the confidence gained in overcoming adversity. 
The two clearly are not synonymous and it is 
plainly not possible to reason from one to the 
other. 

Failure to distinguish between the appropri- 
ateness and strength in the case of recall 
leads to the position that the distortion and 
restriction of reality which results from pro- 
tecting self-esteem is an indication of greater 
strength than accurate perception and a 
corrective response. It also leads to the con- 
clusion that children have stronger egos than 
do adults, a conclusion clearly unwarranted 
by current theory or evidence. A more 
meaningful interpretation would appear to be 
that adults have learned to act more ap- 
propriately than children. Accordingly, they 
may be expected to report responses publicly 
in accord with group expectations, ones that 
do not necessarily correspond to the total or 
most vivid recall experienced. 
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In evaluating the hypothesis (Caron & 
Wallach, 1957; Osgood, 1953) that recall is a 
rehearsal of future behavior with a view 
towards commitment, it is necessary to 
distinguish between those cases of repetition 
in which the S is confronted with the various 
tasks and asked to choose between them, and 
those in which the repetition choices must 
come from the tasks which he has recalled. 
The former case allows a wider psychological 
world to operate than does the latter, which 
may provide alternates for repeating none of 
the variables, only one, or a choice among 
several variations. It is obvious that if the 
person is motivated to overcome a failure 
experience but cannot recall it, repetition is an 
impossibility unless he is confronted by it. 
Since the procedures for aided and unaided 
repetition may produce different results and 
the present test of the rehearsal hypothesis is 
limited to the condition of aided repetition, it 
may be suggested, in view of the relatively 
low correlation (r = .26) between recall and 
repetition, that if recall is a rehearsal, it is not 
one that frequently results in performance. 
Since the results indicate that more successes 
are recalled than failures, unaided repetition, 
i.c., that based solely on the recall of the S, 
would appear to be more likely to result in a 
greater repetition of successes than would 
aided repetition. 

It should be borne in mind that aided 
repetition is more anxiety provoking since it 
forces upon the individual the recognition that 
he has failed, something which can be denied 
when the defenses are allowed full sway. In 
terms of failure repetition, which would appear 
under all circumstances to cope more ef- 
fectively with reality, the Low-High and the 
High-Low groups apparently are more capable 
and willing to undergo anxiety in order to 
attain success than are the High-High and 
Low-Low Ss who continue to repeat successes 
even when confronted with failures. 


SUMMARY 


This study investigates the relationship be- 
tween self-esteem and the recall and repetition 
of success and failure experiences. Four groups, 
of 12 Ss each, were selected on the basis of 
their self-evaluative responses and an evalu- 
ation of their self-esteem behaviors. The 
combinations of these two variables, at their 
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extremes, yielded significantly different 
patterns on such variables as achievement, 
ideal self, and sociometric status, and ap- 
parently represent distinct types of self-esteem. 

Each S was given a series of six pairs of 
tasks, in an achievement oriented situation. 
The S failed one of each pair of tasks and 
succeeded on the other. The tasks were fol- 
lowed by four story completions which had 
been found to elicit n Achievement. Recall was 
then obtained, after which the S was offered 
an opportunity to repeat either the success or 
failure variation of each task. 

The data were analyzed both in terms of the 
tour groups differing in self-esteem and by a 
correlational analysis to determine which 
variables were related to recall and repetition. 
All groups recalled more successes, and none 
of the independent variables was significantly 
related to recall. Groups in which self-rated 
self-esteem was discrepant from behavior rat- 
ings tended to repeat failures; those in which 
the two measures agreed tended to repeat suc- 
cesses. An, index of agreement between self- 
rated self-esteem and behavioral self-esteem 
and n Achitvement were the variables most sig- 
nificantly related to repetition. Analysis of the 
need achievement scores indicated differences 
between the self-esteem types in positive, 
negative, and general categories as well as 
subcategory responses, although none of these 
were related to the experimental behaviors. 

It is proposed that two factors are necessary 
for the recall and repetition of failure. These 
are the ability to tolerate failure and the 
motivation and striving to overcome its 
effects. 
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TUDIES of cooperation in the laboratory 
have been stimulated, in part, by the 
large number of life situations in which 

cooperative action is observed both to occur 
and to break down. Biologists and ethologists 
(Lindzey, 1954, pp. 540-541; Tinbergen, 1953) 
have isolated and analyzed a number of ex- 
amples of cooperative behavior among the 
lower organisms. These relationships, termed 
symbiotic or commensal, have frequently been 
noted in dyads for feeding or protective pur- 
poses and are often composed of members from 
different species. Elaborate systems of coopera- 
tive behavior as weil as breakdowns in these 
systems also pervade human activity (Lindzey, 
1954, p. 754; Mintz, 1951). Many cooperative 
situations are difficult to analyze with cur- 
rently available techniques and theory. The 
abstraction and isolation of variables in the 
laboratory should enhance our understanding 
of these complex phenomena. The purpose of 
this study is to describe a number of reinforce- 
ment parameters in terms appropriate to 
dyadic cooperation and to study these param- 
eters experimentally. The choice of reinforce- 
ment parameters for the present study is 
based upon the general conception that the 
reinforcing events are critical components in 
the development and maintenance of coop- 
erative action. 

Even a cursory review of the literature on 
cooperation reveals a number of different 
usages of the concept (Deutsch, 1949). To 
avoid ambiguity, a definition of a cooperative 
situation is adopted for this study that does 
not attempt to include all possible uses of the 
term, but does share an area of meaning with 
other definitions in the literature and is 
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intended to be applicable to a variety of ex- 
perimental conditions. The definition is formu- 
lated for dyads but extensions to larger groups 
are easily made. A cooperative situation has 
all of the following four characteristics: (¢) 
There exist at least two outcomes for each 
individual following a response. (b) At least 
one of these outcomes is rewarding (i.e., it 
provides positive reinforcement and/or termi- 
nates noxious stimulation), and at least one 
outcome is not rewarding (or less rewarding). 
(c) Reward delivered to one person must not 
preclude its availability for the other. In this 
study a reward was available to both persons 
simultaneously during each trial. A coopera- 
tive situation may be contrived, however, in 
which reward is deliverable to only one or the 
other person in any one trial or time period. 
If the study consists of onty one trial, the 
latter situation would be termed competitive 
rather than cooperative. (d) The outcomes to 
both persons are, in part or entirely, a function 
of the response of the other person. That is, 
the outcome to Person X must depend to some 
extent upon the behavior of Person Y and 
the outcome to Person Y must depend to some 
extent upon the behavior of Person X. If the 
outcome to only one is dependent on the 
behavior of the other, it is only a “partially 
cooperative situation.” It resembles a com- 
mensal rather than a symbiotic relationship. 
There may, however, be additional asocial 
determinants for the outcomes in a coopera- 
tive situation, e.g., a random component. 
Cognitive concepts are not included in this 
definition of a cooperative situation. “Aware- 
ness,”’ defined either in terms of experimenter 
operations (e.g., instructions to the S that it 
is a “cooperative situation”) or in terms of the 
S’s behavior, (e.g., questionnaire to the S to 
determine his verbalizations about the co- 
operative aspects of the situation) is not a 
necessary condition for the specification of a 
cooperative situation. Nor is it always a neces- 
sary condition for the development and main- 
tenance of cooperative action (Hall, 1957; 
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Rosenberg & Hall, 1958; Sidowski, Wyckoff, & 
Tabory, 1956; Sidowski, 1957). That is, the 
consistent occurrence of response combinations 
that produce optimally rewarding outcomes 
for both persons does not require awareness in 
either sense of the term. The exclusion of 
awareness as a necessary condition for the 
definition of a cooperative situation, however, 
implies nothing about its relevance to the 
behaviors that take place in some cooperative 
situations. 

The experimental situation adopted in this 
study has been used in a number of previous 
studies (Hall, 1957; Rosenberg, 1959; Rosen- 
berg & Hall, 1958). While these studies were 
concerned primarily with the development and 
maintenance of precise motor activity in a 
team setting, the task and procedures were 
also found to be quite applicable to the prob- 
lems discussed in this paper. Each person was 
given a series of trials in which he performed 
the simple motor response of turning a small 
concealed knob. It is a trial-and-error response 
in that the only discriminative stimulus for 
each trial was a signal to respond. The rein- 
forcing stimu!us was delivered to each S after 
each trial. Selected parameters of these rein- 
forcing stimuli were considered important to 
the development and stability of cooperative 
action. These parameters were varied experi- 
mentally in a series of three experiments. The 
first experiment deals exclusively with coopera- 
tive situations. The other two experiments 
contain some conditions that are only partially 
cooperative. All conditions and every trial, 
however, involved a combination of outcomes 
that was positiveiy reinforcing to both Ss. 
The degree to which Ss were able to learn to 
procure these simultaneous reinforcements, 
i.e., develop cooperative action, was studied. 

The task used in these studies restricts and 
simplifies the interpersonal actions that occur 
to a much greater degree than has been cus- 
tomary in social psychology. Special emphasis 
has been placed on controlling the way in 
which one person’s responses may affect 
another’s stimulation. The control of these 
linkages appears to be central to extensions 
of S-R descriptions to interpersonal events, 
at least when viewed from our present state 
of understanding. Similar attempts to extend 
S-R theory have been expressed in the experi- 
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mental studies of Sidowski et al. (1956) and 
Sidowski (1957) in their use of a “minimal 
social situation” and in earlier discussions by 
Miller and Dollard (1941), and by Logan 
(1955). 


METHOD 
Apparatus 

Two identical adjacent booths were provided, one 
for each S. The walls of the booth were sufficiently 
high so that Ss could not see each other. One wall of 
each booth consisted of a panel from which a small 
cylindrical knob projected about 4% in. Each S was 
provided with his own panel and knob. The knob was 
mounted about 3 ft. above floor level and was concealed 
by a wooden shield. The S could turn the knob by 
reaching under the shield but had no visual access 
to the knob. The knob was homogeneously knurled on 
its entire surface to avoid giving tactual cues of the 
amount it was turned. The £, seated on the other side 
of the panel, could read from a dial the amount each 
S turned the knob on each trial. 

On the panel in front of each S, about a foot above 
the shielded knob, two or three 6-w. lights (depending 
upon the experimental condition) were mounted about 
1.5 in. apart and labeled so as to provide a reinforcing 
stimulus after each trial. 

The knob apparatus is essentially the same as that 
described in a previous publication (Rosenberg & Hall, 
1958). In this study, however, reinforcing stimulus 
values were two- or three-valued rather than scalar 
quantities. 


Subjects 


Ss were male Air Force basic trainees whose modal 
age was about 17 years. Trainees were assigned as a 
unit for use as Ss in psychological research but were 
excluded form the experiment if their Armed Forces 
Qualification Test (AFQT) score was below the 50th 
percentile rank, which is roughly equivalent to a 
Stanford Binet IQ of 100 (Gragg & Douglass, 1956). 
Ss were used in pairs, randomly assembled. Twenty 
pairs were used for Experiment 1, 60 pairs in Experi- 
ment 2, and 36 pairs in Experiment 3. 


General Procedure* 


Ss in all experiments and in all experimental con- 
ditions were given the same instructions followed by 
50 trials. Instructions were tape-recorded and heard 
by Ss through earphones. An E pointed to and operated 
appropriate parts of the panel when they were men- 
tioned in the instructions. Each S was told: 

The Air Force needs to know more about how well 
people can do certain tasks. The experiment in 
which you are to take part is a study of just one 


*The author wishes to acknowledge the able as- 
sistance of Richard W. Hall and Stuart Dobson in the 
conduct of the experiments. Their dedicated efforts 
made it possible to complete this series of experiments. 
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such task. It is a simple task of turning a knob the 

right amount. Listen carefully to these instructions 

and the experimenter will point out things as they 

are mentioned. On the panel in front of you is a 

shield, and underneath the shield where you cannot 

see it, is a small knob. You may look under the shield 
now to locate the knob, but you are expected to work 
without seeing the knob. Your job is to iearn to turn 
the knob just the right amount. You will have a number 
of chances to do this. The green signal light will tell you 
when to start and stop. When the green light comes on, 
you begin to make your selling. As soon as you have 
made the selting you think is correct, take your hand 
away from the knob. When the green light goes off you 
must take your hand away whether you are finished or 
not. You musi turn your knob each time the green light 
comes on. How much you turn it is what you have to 
learn. You will find that you have plenty of time without 
rushing. A few seconds after you have made your setting, 
you will be shown how well you did by the signal lights 
mounted just above the shield. Always turn your knob 
to the right as indicated by the sign tacked just over the 
shield. After each trial your knob is set back by the 
experimenter so that each time you make a setling you 

are starting from zero. [long pause) Because it is im- 

portant that you understand these instructions, I will 

repeat the most important parts. [Italicized portion 
repeated here.] Now we are ready to start. Do not 
touch your knob until the green signal light comes on. 

Now remove your earphones and watch for the green 

light to come on. 

One complete trial consisted of three automatically 
timed intervals: (a) a response period of 8 sec. indicated 
to the S by a green signal light during which the knob 
could be turned; (6) a delay of 10-12 sec. during which 
E recorded the response magnitude of each S in the 
dyad, set these values into a computer, and read from 
the computer the values of reinforcement to deliver 
to each S; (c) the presentation of reinforcement by 
turning on the appropriate lights of S’s panels for 2 
sec. After 2-3 sec. delay a new trial was started. 

The presence of both Ss in the same room did not 
appear to suggest to any of the Ss that a behavioral 
link might exist between their tasks. A sizable number 
of Ss made spontaneous comments about the task and 
“how it works” at the end of the session but none 
commented about the possibility of an interpersonal 
linkage. The nature and presentation of the instruc- 
tions, the use of separate, completely enclosed booths 
for each S, the absence of an apparent mechanical 
linkage between the apparatus in the two booths all 
seemed to create the set for each S that he was being 
studied as an individual, albeit simultaneously with 
another person. 


EXPERIMENT 1 
Experimental Conditions 


Each member of the dyad was assigned to 
one of five social reinforcement conditions. 
These five conditions differed from each other 
in that a different functional relationship was 
used to determine the St (reinforcement stimu- 
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lus) to an S from the response magnitudes of 
both S, and S,. 

To understand the derivation of the five 
reinforcement conditions, it is first necessary 
to define a variable Z. The five conditions were 
generated by assigning one of five values to 
the parameter, a, or a,, in the following 
equations 


Zz = a,R, + (1 — a,)R, 
Z, (1 — a,) R. + aR, 


where 1 > a > 0, and where R, and R, are 
the numbers of turns by each S on a particular 
trial. Note that the Z function is a simple 
weighted average of the two response magni- 
tudes obtained on each trial and that the value 
of a is the proportion of variability in the Z 
values of an S that is attributable to his own 
response. If a, were 1.00, for example, the 
situation would be reduced to individual trial- 
and-error learning for S,. He would determine 
all of the variability of the Z values. All other 
values of a would be considered “social.” 
The numerical values of Z, and Z, were not 
the actual reinforcement stimuli delivered to 
S, and S,, respectively. The numerical value 
of Z on a trial determined which one of the 
following .three S* events were presented to an 
S on that particular trial: 
when Z < 3.6 turns 


when Z = 40+ < .4 turns 
when Z > 4.4 turns 


1. Poor—Too Little 
2. Good 
3. Poor—Too Much 


This is essentially a coarse grouping of Z 
values, in which weighted average values 
below or above a certain range are Poor and 
with further specification as to direction. The 
same range of Z, 4.0 + <.4 was used to trans- 
form both Z, and Z,. 

When the parameters of both functions, 
Z, and Z,, have been specified numerically, 
and the relationship between Z values and S* 
events stated, an “interpersonal reward sys- 
tem” is said to have been constructed.’ In 
Experiment 1, five interpersonal reward sys- 
tems were constructed by using the five a 
values: .55, .525, .50, .475, and .45, and im- 


*Note that the term “reinforcement condition” 
refers to the operations by which an S* to a single S is 
determined and that “interpersonal reward system” 
refers to the combination of “reinforcement conditions” 
for two or more persons. This terminology is used 
consistently throughout the paper. 
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Fic. 1. Graphic summary of the cooperative reward 
systems used in the three experiments. The circled 
numbers represent the combinations of conditions for 
S, and S, in each experiment. The circled Digit 1 
represents the combinations in Experiment 1, the Digit 
2 in Experiment 2, and the Digit 3 in Experiment 3. 
The other half of the square would be a repetition of the 
same combinations with the conditions reversed for 
S, and Sy. 


posing the restriction that a, = a, for each 
reward system. Thus, Experiment 1 was 
limited to reward systems in which each mem- 
ber of the dyad received the same proportion 
of his own response in his reinforcement. That 
is, when a, was .525, for example, a, was also 
525. This should be carefully distinguished 
from a system in which both members receive 
the same reinforcement values, (e.g., @. = 
525, a, = .475). Figure 1 is a graphic summary 
of the combinations of a, and a, chosen in 
Experiments 1, 2, and 3. The five combina- 
tions of ¢, and a, used in Experiment 1 are 
indicated by the figure 1 in a circle. The ra- 
tionale for choosing these particular functions 
was derived from previous study. Previous 
data (Rosenberg, 1959) suggested that these 
five points represent the critical range of 
the continuum from those systems in which 
procurement and maintenance of reward 
develop for both Ss to those systems in which 
continuous reward failed to develop for at 
least one of the two Ss. 

It should be noted that all five reward sys- 
tems in Experiment 1 satisfy the definition of 
a cooperative situation on every trial. That is, 
there exist at least two outcomes for each S 
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following a response, at least one is rewarding 
(Good) and at least one is not (Poor), reward 
to one S does not preclude its availability for 
the other, and, finally, the @ values chosen 
determine that the outcomes to both Ss are, 
in part or entirely, a function of the response 
of the other S. In fact, all combinations of 
a, and a, in Fig. 1, except where a, or a, equals 
1.00, are cooperative interpersonal reward 
systems. 


Results and Discussion 


Figure 2 summarizes the essential findings 
of Experiment 1. Careful study of this figure 
will clarify the experimental conditions and 
the findings considerably. Results of subse- 
quent experiments are presented in a similar 
manner. 

The columns of small graphs represent the 
first two and last two successive blocks of five 
trials, as labeled across the top. These trial 
blocks seemed sufficient to demonstrate both 
(a) the direction of response changes during 
the 50 trials, and (6) the stability of the re- 
sponse values at the end of the experiment 
(comparisons of Trial Block 41-45 with 46- 


50). Each row of graphs represents one of the 
five reinforcement systems and is labeled in 
the first graph of each row. For example, S* 
.525-.525 in the second row indicates that 
reinforcing stimuli were based upon Z, = 


525 R, + .475 R, and Z, = 475 R, + 
525 R,. The two coordinates, x and y, for 
each point plotted are based upon the averages 
of response magnitudes for five trials performed 
by S, and S,, respectively. A symbol shape 
(e.g., triangle, circle) is used consistently 
within each row to represent measurements 
from the same dyad, but similar symbol 
shapes, among different rows, refer to different 
dyads. A single symbol always represents 
responses made by the two members of a 
single dyad, never an average of several dyads. 

Within a graph, the two pairs of parallel 
lines were derived from the equations for Z, 
and Z,. The area between a set of parallel lines 
is the Good range for either S, or Sy. The area 
below and/or to the left of a set of parallel 
lines is the Poor—Too Little range and the 
area above and/or to the right is the Poor— 
Too Much range for an S. The combinations 
of these St events for each subarea are shown 
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Fic. 2. Summary of the results of Experiment 1. The first two and last two successive blocks of five trials 
are shown as colurnns of small graphs and the five reinforcement systems as rows of small graphs. A symbol shape 
(e.g., triangle, circle) within each row represents the same dyad, but similar shapes among different rows refer to 
different dyads. 





COOPERATIVE BEHAVIOR IN Dyaps 


in the first graph of a row. For example, if the 
two response magnitudes were such as to fall 
anywhere in the (+, +) area, both Ss were 
told Poor—Too Much; or, if the two responses 
fell into the (G, —) area, S, was told Good 
and S, was told Poor—Too Little; etc. 

The shaded areas in the first column of 
graphs are the areas in which it is predicted 
that a pair of .esponses would stabilize over a 
series of trials and are termed “absorbing 
areas.” The presence of two absorbing areas 
within a reward system indicates a prediction 
of response combinations in one or the other 
area, and not that there will be oscillation 
between these areas. 

The absorbing areas are derived from the 
following simple assumptions: 

1. An S will tend to decrease his response 
magnitude following a trial in which he is told 
Too Much and increase it if he is told Too 
Little. (This assumption fails to hold, of course, 
at the extremes, i.e., either when an S is giving 
an extremely small response and is told Too 
Much, or when he is giving a large response 
approaching the limit of his speed of turning 
and is told Too Little.) 

2. If S is told Good the change in his re- 
sponse magnitude on the following trial is 
smaller than changes with the other two feed- 
back stimuli (but not zero). (It is important 
to note that over a series of Good trials, these 
small response changes would not vary around 
a constant magnitude. Changes would either 
occur according to a “random walk” and/or 
tend to drift in the direction of least effort.) 

3. The directions of the response changes 
by S, and S, following a given S* are assumed 
to be additive and only the directions of 
response change are required to predict ab- 
sorbing area(s). The rate of response changes 
is ignored in the summations to determine 
these areas. (Where two areas are predicted or 
where the predicted area is large, differential 
initial rates of response change probably de- 
termine which of the two areas or which por- 
tion of a single area will be the absorbing one. 
If one is interested in the proportion of trials 
spent in each area, response rates are also 
important.) 

With these assumptions, a vector of response 
change may be drawn for each area within a 
graph. These vectors are all either vertical, 
horizontal, or at a 45° angle. A few examples 


are: 


” 


’ 
G,- Tf 
+,Ge 
TT, T. < 
G, G — 

Having drawn vectors in all the subareas, 
one can predict the resultant movements of a 
combination of responses over a series of trials. 
In the first three rows in Fig. 2, the functions 
of Z generate a set of vectors converging on 
the (G, G) area from ali paris of all other 
subareas. A response combination occurring in 
any subarea other than (G, G) would be fol- 
lowed by an S’ combination tending to change 
the responses toward the (G, G) area only. 
In the bottom two rows, however, vectors in 
two subareas, (G, +) and (+, G), do not 
point exclusively to the (G, G) area. That is, 
the S* combination in a small portion of each 
of these two subareas is expected to change the 
response magnitude of one S$ but the change 
should still maintain the response combina- 
tion in that subarea. Even if response combina- 
tions do occur in the (G, G) area during the 
process, these combinations are not stable 
over time (Assumption 2, above) unless all 
parts of all subareas lead to the (G, G) area. 
The ultimate absorbing areas in the bottom 
two rows are, therefore, (G, +) and (+, G) 
subareas and not the (G, G) area. The deeper 
shading within these two absorbing areas 
represents a prediction of responses to that 
part of the absorbing area, i.e., where one S 
is performing extremely small responses and 
the other S is performing responses somewhere 
within the range designated by the deeper 
shading. This area of response combinations 
is termed, for convenience, an “absorbing 
wall.” The absorbing wall exists because there 
is no provision for “negative responses.” In 
the case of this study, no turns of the knob in 
the opposite direction are possible. If this 
direction of turning were permitted, the pre- 
diction for these two reward systems would 
not be of the same areas but would be of areas 
with a similar consequence, ie., a failure in 
the optimal development of cooperative action. 
The important point is not whether or not an 
absorbing wall exists, i.e., where one S can be 
told Too Much for very small response magni- 
tudes and cannot make negative responses, 
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but rather that there are parametric values of 
interpersonal reward systems that lead to the 
breakdown of optimal cooperative action. 

Other Z functions, outside the domain of 
the present study, may generate a set of vec- 
tors for which the resultant is not readily 
apparent and/or for which one would predict 
continuous movement from one subarea tio 
another rather than the presence of absorbing 
area(s) or wall(s).* 

Visual examination of the graphs shows 
striking accuracy of prediction for almost 
every dyad in every condition. The only 
reward system in which there appears to be any 
deviation from prediction is in St .475-.475 
(fourth row) and this can probably be attrib- 
uted to the relatively small absorbing area. 
Even here, however, there is no radical de- 
parture from prediction. The points are all in 
close proximity to the predicted areas. Com- 
parison of Trial Block 41-45 with Trial Block 
46-50 also shows, graphically, the high sta- 
bility over time of response pairs in all condi- 
tions after 40 trials. 

Two analyses were performed to establish 
statistical significance of these observations. 
The first tested the adequacy of the predictions 
over all conditions. The second analysis ex- 
amined the possibility of differentially effec- 
tive prediction among conditions. Both 
analyses were limited to data from the last 
10 trials. For each of the 20 dyads, a count was 
made of those trials in which the response 
combinations fell outside the predicted area. 
This number was subtracted from the number 
of trials in which prediction was verified. 
These 20 difference values (theoretical range: 
+10 to —10) had a mean of +5.20 and 
a of 5.60. The positive value for the mean 
indicated more hits than misses on the average. 
A t for differences against the null hypothesis 
that there were as many hits as misses yielded 

* Many of the two-person game matrices of current 
interest to probability learning theorists and to social 
psychologists appear to result in asymptotic response 
probabilities in a cell or “area” of less than 1.0, i.e., 
continuous movement from one cell to another. More 
quantitatively stated assumptions are required to 
predict these response probabilities for each area in such 
interpersonal reward systems (Atkinson & Suppes, 
1958; Estes, 1957). It is interesting to note that response 
sequences in these systems rarely receive analytic 
treatment. The present paper is an attempt in this 
direction. 
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a value of 3.883 (df 19), significant beyond the 
001 level.’ The second analysis was a simple 
F test (between and-within groups breakdown) 
in which the same difference scores were com- 
pared among the five systems. The null 
hypothesis was that each of the five conditions 
was equally predictable as measured by these 
difference scores. A qualitative comparison of 
the mean difference for each condition does 
suggest that S'* .475-.475 is somewhat less 
predictable than the others, but an F of 1.88 
(df 4, 15) is not sufficiently large to reject the 
null hypothesis at the .05 level. 

The statistical examinations of the data 
support the readily observable conclusion 
that availability of a cooperative situation 
does not assure optimal cooperative action, 
i.e., procurement of continuous positive rein- 
forcement for both Ss on every trial. If we 
view the five reward systems as points on a 
continuum, there appears to be a critical value 
below which optimal cooperative action fails 
to develop. In terms of Experiment 1, both 
Ss receive continuous positive reinforcement 
when the reinforcement value to each is based 
upon 50% or more of his own response magni- 
tude, ie., when a4, = a, > .50 in the two Z 
functions. Proportions less than 50% result 
in partial or no reinforcement to one S. Experi- 
ment 3 will demonstrate that a, = a, is not a 
necessary condition for either of these effects 
and will provide a generalization for all reward 
systems depicted by Fig. 1. 

Before additional empirical exploration, it 
is worth examining the features of these co- 
operative situations which covary with manip- 
ulations of the Z functions. It is obvious from 
Fig. 2, for example, that the relative sizes of 
the subareas vary from row to row. And in 
S* .50-.50 four subareas are completely absent. 
However, there is little reason to interpret the 
results in terms of available areas or their 
sizes relative to each other. The selection of 
systems above and below the middle row was 
such as to control for subarea sizes. Thus, for 
example, S* .55-.55 and S* .45-.45 have com- 
parable subareas and they are of the same size. 

It is possible to suggest other interpretations 


* This is a rather conservative estimate of the ac- 
curacy of the prediction since it fails to credit occur- 
rences in proximal areas, i.e., “near misses” and since 
predicted areas are very small relative to total area. 
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of the results in reward systems that fail to 
elicit optimal action. It may be that the 
absorbing areas are valid because of the im- 
probability that Ss ever encounter the (G, G) 
outcome. If they did, this interpretation would 
predict that their responses would stabilize in 
(G, G) rather than in (G, +) or (+, G). The 
data fail to support this interpretation. In S* 
45-45 and S* .475-.475 there is an average of 
7.0 and 19.8 (G, G) outcomes in the first 40 
trials. This fact is discernible to some extent in 
Fig. 2 for Trial Blocks 6-10. In addition, 
results of a previous study (Rosenberg, 1959) 
demonstrated very clearly that even if Ss are 
first individually trained to a point where they 
are both receiving continuous reinforcement, 
i.e., placed in the (G, G) area, the change to a 
cooperative condition results in gradual de- 
terioration of reinforcement if the relative 
response weighting is less than S* .50-.50. 

Finally, one may explain stability of a non- 
optimal absorbing wall in terms of partial rein- 
forcement for one S and continuous reinforce- 
ment for the second S. Since the (G, G) area 
is adjacent to the (G, +) and (+, G) areas, 
the (G, G) outcome may occur with sufficient 
frequency to reinforce S, or S, (as the case 
may be) for very small response magnitudes, 
and thereby produce relatively stable be- 
havior. This explanation was tested in Experi- 
ment 2 by studying more extreme conditions 
where partial reinforcement was less likely to 
occur, i.e., where the (G, G) area was not 
adjacent to (G, +) or (+, G) areas. This 
extension was combined with an analysis of a 
second parameter of the reinforcing stimulus, 
i.e., its informational ambiguity. 


EXPERIMENT 2 


In essence, the results of Experiment 1 
demonstrated that Ss who are placed in a 
reward system in which relatively few out- 
comes may be misleading fail to gravitate 
toward optimal cooperative behavior in spite 
of the presence of an optimal outcome and in 
spite of the occurrence of such an optimal 
outcome during the learning sequence. This 
finding suggests that a cooperative situation 
be constructed usingan unordered set of rein- 
forcing events, and hence, no “misleading” 
outcomes. Such unordered S* events are less 
explicit either because (@) they are only quali- 
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tatively discriminable to S, although outcomes 
do implicitly covary with the quantitative 
combinations of R’s or (6) unrewarding out- 
comes actually contain no information about 
what kind of response change is needed to 
bring about the rewarding outcome. The reduc- 
tion of explicit information by either alterna- 
tive may retard learning in cooperative situa- 
tions where explicit information would have 
resulted in rapid learning. On the other hand, 
an unordered set of outcomes may produce 
behavior that yields a larger proportion of 
rewarding outcomes in cooperative situations 
where more explicit information about Z 
values would be misleading. Thus, three sets 
of reinforcing events varying in informational 
content about Z were compared in Experi- 
ment 2. Information about the direction of 
nonreinforcement has been studied extensively 
in individual learning (Stevens, 1951, pp. 1267- 
1270), but has received scant attention as a 
parameter in interpersonal learning. Finally, 
a wider range of a values was studied in Experi- 
ment 2 in order to assess the limits of generality 
of results in Experiment 1. 


Experimental Conditions 


Each dyad was assigned to one of the five 
reward systems indicated in Fig. 1 by the 
number 2 in a circle, and to one of the three 
sets of reinforcing events. Thus, there were 15 
experimental conditions (5 X 3) with four 
dyads in each condition. 

In terms of the equations 


Z, = a,R. + (1 — a,)R, 
Z, = (1 — a,)R. + a,R, 


a, equalled a, within a dyad. As in Experi- 
ment 1, therefore, S, received the same pro- 
portion of R, as did S, of R, in each dyad. The 
five values of a were 1.00, .60, .50, .40, .00. 
It might be noted that the first combination 
of values of a (1.00, 1.00) is at an extreme which 
is completely asocial since neither S depends 
upon the other for his reinforcement. It is the 
only combination in Experiment 2 which fails 
to fulfill the definition of a cooperative situa- 
tion. 

The three sets of reinforcement events vary- 
ing in information about Z were: 
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Fic. 3. Summary of the results of Experiment 2 (see explanation of Fig. 2). The filled symbol shapes refer to 
dyads given explicit information about plus or minus values shown in each subarea of a graph; the open symbols 
with horizontal! bar refer to dyads given ambiguous information about the plus or minus values; the open symbols 
represent dyads given only “poor’’ or “good” as outcomes. 
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Poor—Too Much; Good; Poor—Too 
Little 

. Poor; Good; Poor 

. Poor; Good 


(MGL) 
(PGP) 
(PG) 


The first set is the same as that used in Experi- 
ment 1. In the second set, Too Much and Too 
Little were left off, but which of the two Poor 
outcome lights was given on a trial depended 
consistently on whether the Z value was above 
or below the Good range. In the third type, 
all Z values above and below the Good range 
were called Poor, and hence only two outcome 
lights were used, one for Good and one for 
Poor. 


Results and Discussion 


Figure 3 is a graphic summary of the results. 
Format is similar to that employed in Fig. 2. 
Additional notations were required to dis- 
tinguish the three sets of reinforcement stimuli. 
The solid black symbols represent the MGL 
set (used exclusively in Experiment 1), the 
symbols with horizontal bar represent the 
PGP set, and the open symbols represent the 
PG set of two stimuli. Use of the same symbols 
within a row indicates successive measure- 
ments from the same dyad while similar 
symbols in different rows refer to different 
dyads. 

Also, each subarea on the graph is labeled 
with two pairs of coordinates. Coordinates in 
parenthesis refer to the explicit or implicit 
values associated with the MGL and PGP 
conditions, respectively. Coordinates outside 
of parenthesis refer to a pair of outcomes in 
the PG condition. The shaded areas were 
derived from the same assumptions stated in 
Experiment 1, although the validity of these 
assumptions for the PGP set and certainly 
for the PG set is questionable. 

An analysis was first performed to ascertain 
the differential validity of these assumptions 
on the last 10 trials for all 15 conditions. A 
factorial design was employed containing two 
criteria of classification: (a) five reward sys- 
tems and (6) the three sets of reinforcement 
events varying in informational content. The 
dependent variable for each dyad was simply 
the number of trials between 41 and SO in 
which the response combination occurred in 
the preducted area minus the number of occur- 
rences in all other areas. The analysis is sum- 
marized in Table 1 and indicates a significant 


F only among the three reinforcement sets 
varying in information. The graphs in Fig. 3 
illustrate very clearly the nature of these 
differences. The assumptions for deriving the 
shaded area are least valid for the PG condi- 
tion. There are almost no dyads in this condi- 
tion whose behavior conform to prediction. 
The assmptions are more valid for the PGP 
conditions but still fall short of the very con- 
sistent findings for MGL condition. Deviation 
from prediction does occur in the MGL condi- 
tion also, particularly near the extreme where 
the @ values are small, i.e., where cooperative 
action is not predicted to develop optimally. 
In S* .00-.00, where it was predicted that 
neither S would receive reinforcement, the 
four dyads conform to prediction in the early 
trials, e.g., 6-10, but by the end of the experi- 
mental session two dyads have moved out of 
the predicted areas. Also in St .40-.40, one 
dyad changed responses so as to move out of 
the predicted area during the last five trials. 
These response changes out of predicted areas, 
interestingly enough, do not result in more 
effective cooperative action, at least during the 
number of trials studied. These findings for 
small @ values lend support to the hypothesis 
suggested in discussion of Experiment 1, i.e., 
that a nonoptimal absorbing area inay depend 
for its long-term stability on partial reinforce- 
ment of the small response magnitudes by 
one S. A significant interaction of reward 
system with information content, iie., 1 XK 2 
in Table 1, could support this interpretation 
more fully. The absence of interaction fails to 
refute, however, the generality of the assump- 
tions used to derive predicted subareas. More 
Ss and more trials may be required to refute 
the validity of the assumptions for extreme 


TABLE 1 


ANALYSIS OF VARIANCE OF THE Last 10 TRIALs OF 
EXPERIMENT 2 USING THE DiFreERENCE ScORES 
(FREQUENCY OF PrEepicTeED Argess Minus 
FREQUENCY OF OTHER AREAS) AS THE 

DEPENDENT VARIABLE 








Reward System 
Reinforcement 
Sets Varying in 
information 
1x2 .94 
Within Cells 5 29.62 (error term) 


"P< 01 
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reward systems involving the set of outcomes, 
MGL. 

Accurate prediction for the PGP and PG 
conditions obviously cannot be completely 
derived from these assumptions. The PGP 
condition results in behavior that conforms, in 
part, to the predictions. Apparently the early 
trials provide a sufficiently consistent relation- 
ship between changes in response magnitude 
and presentation of the two Poor outcomes so 
that Ss learn to “impose” an appropriate 
ordering of these two Poor outcomes, i.e., 
discriminate between them. The particular 
reward system may be such as to continue to 
reinforce this discrimination of the two Poor 
outcomes. However, ii other reward systems 
(bottom two rows) the decrement in reward 
frequency may result in the extinction of this 
implicit ordering behavior. Whether, at this 
point, the S reverses the values, i.e., reverses 
his previous discrimination, or simply lumps 
the two Poor outcomes, i.e., the discrimination 
is extinguished, or is inconsistent is difficult to 
infer from these data. In any case, however, the 
frequency of reward does not increase appre- 
ciably over time in the conditions represented 
by the bottom two rows. There is, simply, a 
lower frequency of the very large or very small 
response magnitudes that are found in the 
MGL condition. 

The validity of the predicted areas for the 
PG condition is nil. This finding is not sur- 
prising since the assumptions from which 
these areas are derived require informational 
components in the reinforcement not present 
in the PG condition. The S is given no informa- 
tion about response magnitude, and hence 
there is no reason to assume either a consistent 
increase or a consistent decrease in response 
magnitude following a Poor outcome, i.e., the 
first assumption for an MGL condition would 
be discarded. One could retain the two remain- 
ing assumptions, however, which are essentially 
that an S will tend to make a larger change in 
response magnitude (sometimes increasing it 
and sometimes decreasing it) following a Poor 
outcome than following a Good outcome and 
that the directions of response changes by the 
two Ss are additive. With these assumptions, 
it is possible to derive a number of interesting 
statements that contrast with those made for 
the other two types of conditions. First, one 
would predict that in the cooperative reward 
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systems in this study, the most stable subarea, 
if any, would be the (G, G) area. That is, if a 
response combination falls into the (P, G) 
areas, S, will make large response changes. 
Similarly, if the combination is (G, P), S, will 
make large response changes. A (P, P) outcome 
will yield large response changes by both Ss. 
The sizes of these areas relative to the (G, G) 
area should, therefore, be one determinant in 
the frequency of reward for both Ss. Hence, 
S .50-.50 might be expected to produce more 
Good outcomes than adjacent conditions since 
it has the largest (G, G) area and no (P, G) 
or (G, P) areas. It is the one condition in which 
both Ss make large response changes or neither 
does.* This seems to be the effect for at least 
two of the four dyads in this condition. By 
contrast, neither the adjacent conditions S* 
.60--60 nor S* .40-.40 reward systems pro- 
duce nearly such a high proportion of reward 
for both Ss. 

The relevance of the Z functions themselves 
in the PG condition is also derived from this 
reasoning. We have assumed that (P, G) and 
(G, P) are unstable to some extent because 
either S, or S,, respectively, makes large 
response changes. The extent to which these 
outcomes are unstable depends in turn on the 
pair of Z functions. Thus, if the outcome is 
(P, G) and the response of S, received a large 
weighting in the reinforcement of S,, it would 
be expected that S, would soon fail to receive 
positive reward. This in turn increases the 
size of response change of S,. On the other 
hand, Z functions that make one or both Ss 
relatively independent of the other would 
produce a greater number of Good outcomes 
for one or both S. This is discernible in the ex- 
treme condition St 1.00-1.00 where both Ss 
are independent of each other. Nevertheless, 
S* .50-.50, which appears to clearly divide 
optimal from nonoptimal reward systems in 
the MGL condition, is not the critical point in 
the PG condition. 

* It is interesting to note that the S* .50-.50 is the 
only reward system which satisfies some definitions of a 
cooperative situation, viz., when one receives reward, 
all do, when one fails to receive reward, no one does 
(Lindzey, 1954, p. 810). One may speculate about the 
possibility that this restrictive definition evolved be- 
cause cooperative action develops best here with 
equivocal outcomes and that the tasks typically used 
in the past for cooperative studies do contain 
more equivocal reinforcing events than those in Experi- 
ment 1. 
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It would be interesting to pretrain both Ss 
individually so that their response combina- 
tions fell into the (G, G) area and then change 
them to a cooperative situation. Because of 
the response drift, and cn the basis of this 
experiment, it is quite probable that a (G, P), 
(P, G), or (P, P) condition would soon occur 
and that return to (G, G) would then be as 
difficult for such a dyad as for one which had 
received no preliminary individual training. 

In summary, then, using a PG set of out- 
comes, one would probably predict that no 
subarea is as stable as those found in more 
informative outcomes. In general, one would 
predict that the (G, G) area is most stable, 
the (P, G) or (G, P) areas next most stable, 
and (P, P).areas least stable. The frequency of 
occurrence of responses in any of these areas 
depend upon (a) the relative sizes of these 


TRIALS 1-5 TRIALS 6-10 
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areas and (6) the degree of independence 
between persons as determined by the pair of 
Z functions. 

Quantitative assumptions are required to 
make more exact predictions about outcome 
stability as a function of these experimental 
parameters. Qualitatively, it seems surprising, 
nevertheless, that the wide variations in the Z 
functions in this study (short of complete 
autonomy) result in such an ineffective pro- 
curement of positive reinforcement for both 
Ss over all 50 trials in the PG condition. There 
is some reason to think that this poor per- 
formance is, in part, a function of the type of S 
used in this study. It was observed that in 
many dyads both members failed to demon- 
strate much exploratory behavior over the 
entire 50 trials. They failed to increase their 
response magnitudes sufficiently to get out of 
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the lower left (P, P) area. This depressed 
activity may be inherent in the nature of the 
sample of Ss. They were all persons who had 
just entered Air Force service and hence were 
in a very strange and stressful environment 
generally. It is unknown whether other, more 
relaxed Ss may have simply elevated the total 
frequency of reward over all reward systems, 
or whether the effects of different reward sys- 
tems would have been more clearly distin- 
guishable using the PG outcomes with the 
more “relaxed”’ Ss. 
EXPERIMENT 3 

The purpose of the final experiment is best 
understood by reference to Fig. 1, the graphic 
representation of the combinations of Z func- 
tions for each dyad. The results of the first 
two experiments indicate that a predicted 
subarea for the most explicit form of reinforce- 
ment stimulus, MGL, is unrewarding to one 
or both Ss when a reward system is imposed in 
which both receive as feedback less than half 
of their own response variance in the S* and 
when a, = a,. This is the lower half of the 
hypotenuse of the triangle in Fig. 1. By extrap- 
olation, it may be demonstrated that these 
two restrictions are not necessary conditions 
for predicting an absorbing area containing an 
unrewarding outcome for one S. In fact, a 
nonoptimal absorbing area will be predicted 
even though one S may receive a majority of 
his response in his reinforcement as long as 


a, + ay < 1.00 


This equation defines the shaded area in 
Fig. 1. When a, * a, the effect is to increase 
the probability of unrewarding outcomes to 
the S assigned to the smaller a value and de- 
crease the probability of unrewarding outcomes 
to the S assigned the larger a value. 

The purpose of this experiment is to demon- 
strate these predictions empirically by select- 
ing three reward systems not on the hypote- 
nuse of the triangle where 

a, + a, > 1.00 
a, + a, = 1.00 
a, + a, < 1.00 


Experimental Conditions 

Each dyad was assigned to one of the three 
degrees of S* information, i.c.. MGL, PGP, 
PG, and to one of the three reward systems 
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TABLE 2 
ANALYSIS OF VARIANCE OF THE Last 10 TRIALS OF 
EXPERIMENT 3 USING THE DIFFERENCE SCORES 
(FREQUENCY OF PREDICTED AREAS MINUS 
FREQUENCY OF OTHER AREAS) AS THE 
DEPENDENT VARIABLE 
df Mean Square F 
85.77 4.92* 


Reward System 2 
2 15.60*** 


Reinforcement 
Sets Varying in 
information 

5.47** 

(error term) 


1X2 
Within Cells 


*P < .05. 
"P< 01. 
see P < 001. 


shown in Fig. 1 by the number 3 in a circle. 
This made a 3 X 3 design with four dyads in 
each cell. 

The sum of values for a, and a, were 1.50, 
1.00, and .50. The @ values were, respectively, 
(a) .50, 1.00; (6) .00, 1.00; (c) .00, 50. 


Results and Discussion 

Figure 4 is a graphic summary of the results. 
The same notation was used here as in previous 
figures. 

Qualitative inspection supports the validity 
of predictions for the MGL condition. The 
two other types of S* information show less 
tendency to produce response pairs in the pre- 
dicted areas, and again, the PG condition is 
least predictive. Note also that the PG condi- 
tion is in general somewhat less sensitive to the 
different reward systems than are the other 
two informational conditions. 

An analysis of variance of the results for the 
last 10 trials is summarized in Table 2 and 
corroborates the significant differences among 
the three reinforcement types. Significant 
effects attributable to different reward systems 
and to interaction were also obtained. Figure 4 
suggests the nature of these effects. The PGP 
and, to some extent, the PG conditions show 
better predictability when (G, G) is the pre- 
dicted outcome than when (P, G) or (G, P) 
is predicted. This result is in line with the 
discussion of these conditions in Experiment 2. 


GENERAL DISCUSSION 


There appears to be a growing interest in 
the simplification and increased control of 
laboratory situations employed to study inter- 
personal phenomena. This trend stems, in 
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part, from the analyses of interpersonal be- 
havior based upon an S-R system. (Lindzey, 
1954, pp. 77-87; Logan, 1955; Miller & 
Dollard, 1941; Sears, 1951.) These analyses 
usually include concepts and principles of 
learning, and the empirical studies almost 
invariably employ laboratory situations that 
are highly controlled with regard to the stimu- 
lus and response elements of an interpersonal 
event (e.g., Miller & Dollard, 1941; Rosenberg 
& Hall, 1958; Rosenberg, 1959; Sidowski, 
Wyckoff, & Tabory, 1956). Some writers, 
whether they apply an S-R system or not, 
employ these highly controlled situations for 
the application and development of rather 
formal theoretical systems, e.g., mathematical 
models of interpersonal events (Atkinson & 
Suppes, 1958; Estes, 1957; Hays & Bush, 
1954). 

The present study is concerned with the 
problem of organizing a set of interpersonal 
variables on a descriptive S-R level and in 
refining experimental techniques rather than 
in quantitative theory construction. The 


descriptive approach, however, is not aimed 
at the proliferation of experimental variables 


without regard to some parsimonious grouping 
of experimental operations. The present study, 
for example, would appear to provide general- 
ization for a number of Z functions in addition 
to the simple additive ones used here. It 
would appear to matter very little, for example, 
whether the functions for S, and/or S, were 
curvilinear rather than linear. If one could 
determine unequivocably that absorbing 
area(s) existed for a pair of Z functions (and 
this is possible for a variety of nonlinear func- 
tions), it would matter little that these func- 
tions were nonlinear. 

There are combinations of functions, how- 
ever, for which no absorbing area(s) can be 
predicted from the assumptions in this paper. 
For example, if one used the function Z = 
aR, — 6bR, for one S and Z = a’R, + WR, 
for the second S, one would be able to predict 
from the assumptions used in this study only 
that R, and R, would change continually 
from trial to trial so as to “orbit” around the 
(G, G) area. There is no way of predicting 
from present assumptions whether the orb.t 
would become smaller and hence converge on 
the (G, G) area, become larger, change shape, 
etc. Experimental results using this interesting 
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pair of general functions would presumably 
permit some generalization to a variety of 
similar combinations of functions. It is interest- 
ing to note, parenthetically, that the Z = 
aR, — 5bR, function for one S in combination 
with a function describing individual discrimi- 
nation learning by a second S§ resembles the 
imitation paradigin of Miller and Dollard 
(1941) and that an absorbing area does exist 
for this reward system in the (G, G) area. 

Another interesting set of function pairs is 
that in which no (G, G) area exists. That is, 
the Z functions do not intersect anywhere on 
the graph. This specification would define a 
competitive task, if only one trial were per- 
mitted. Over a series of trials, however, such a 
combination of functions would fall within 
the definition of a cooperation. It would not be 
surprising, however, to find dyads unable to 
develop optimal cooperative action nor even 
to procure an optimal sum of individual reward 
frequencies although the situation permitted 
the occurrence of a rewarding outcome to one 
or the other S on each trial. Studies that com- 
bine interests in learning and in game matrices 
usually employ examples of this set of function 
pairs. 

A second class of variables subject to empiri- 
cal analysis and generally independent of the 
Z functions, are those termed “awareness”. 
While these variables have received consider- 
able prominence in social psychology (some 
social psychologists would reject a situation as 
interpersonal if Ss were not “aware” of social 
linkages), there has been little systematic 
attempt to formulate a parametric approach 
for this class of operations. When “‘awareness”’ 
is manipulated by E directly, it usually takes 
on only two values—an S is or is not told 
something. (Neimark & Rosenberg, 1959; 
Sidowski, 1957) Some experimental tasks, e.g., 
face-to-face, are so constructed that extensive 
manipulation of awareness is not possible. 
Other tasks such as the one in this study permit 
the Z to assign many “values” to awareness. 
These values may range from telling each S 
nothing about the interpersonal linkage during 
the initial instructions, to a complete descrip- 
tion for both Ss of the functions by which 
their responses are to be combined. Note also 
that the task in this study permits the E the 
choice of delivering to each S rewards of which 
the other S may or may not be made aware 
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from trial to trial, and thereby extended the 
concept beyond the instructional differences. 

The implications of the present study would 
appear to extend beyond that of a knob-turn- 
ing task. Pilot replications of aspects of this 
study by the author using other trial-and-error 
tasks, e.g., time estimation, size estimation, 
suggest ready generalization of results to these 
tasks. One would also expect, for example, 
some generalization to the autokinetic re- 
sponse. While it is premature to outline im- 
plications of the present results to life situa- 
tions, it seems likely that the interpersonal 
reward systems imposed by laboratory situa- 
tions can be coordinated with systems of 
reward imposed upon a dyad by some natural 
or mechanical environments—and perhaps 
can be eventually coordinated to reward sys- 
tems imposed by the behavior repertoires of 
the interactors. 


SUMMARY 


Reinforcement parameters for a dyadic 


cooperative situation were defined and varied 
in a series of three experiments. A cooperative 
situation was defined as one where (a) there 


exist at least two outcomes for each S, (6) one 
of which is a rewarding state of affairs, (c) 
reward delivered to one person does not pre- 
clude its availability for the other, and (d) 
the outcomes to both persons are, in part or 
entirely, a function of the behavior of the 
other person. The task was the same in all 
three experiments. Each S was required to 
perform the simple motor response of turning 
a small concealed knob in a series of 50 trials. 
A reinforcing stimulus was delivered to each S 
separately after each trial. All conditions either 
satisfied the definition of a cooperative situa- 
tion or were a social (wherein reinforcement 
was contingent upon individual behavior 
alone). The degree to which optimal coopera- 
tive behavior, i.e., procurement of maximum 
reinforcement by both Ss developed in these 
situations, was studied as a dependent vari- 
able. 

Experiment 1 was composed of five coopera- 
tive reward systems. These systems varied in 
the extent to which the response magnitude of 
one S of a dyad determined the reinforcement 
stimulus for the other S and vice versa. That 
is, the reinforcement stimulus, Good, was 
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delivered to an S on trials when the Z value in 
the following equation fell within a predeter- 
mined range. 


Z= aR, + (1 — a)Ry 


where R, and R, are response magnitudes of 
the two Ss and where a is assigned a value 
between zero and one for each S. Predictions 
were made concerning the degree to which 
cooperative action would develop as a func- 
tion of these relative values of @ for each S. 
It was predicted that in three reward systems, 
where both Ss’ reinforcement stimulus was con- 
tingent upon a fraction equal to or greater 
than .50 of his own response magnitude, 
optimal cooperative behavior would develop 
quickly and would be maintained consistently. 
In the two remaining reward systems, how- 
ever, where both Ss’ reinforcement stimulus 
was contingent upon less than .50 of his own 
response magnitude, optimal cooperative be- 
havior was not predicted. The results clearly 
substantiated these predictions. 

Experiment 2 extended the range of relative 
response weightings to its full range, ie., @ 
values from zero to one were used for different 
sets of dyads. This extended range tested the 
generality of the predictive assumptions used 
in Experiment 1. Predictions were substan- 
tiated for this extended range of weightings. 
The weighting, .50, was again demonstrated 
to be the critical point, i.e., when reinforce- 
ments to both Ss were weighted by less than 
.50 of their own responses, optimal cooperative 
action failed to develop. 

Informational content concerning the size 
of Z was varied in Experiment 2. It was found 
that predictions about Ss’ behavior were less 
valid as the information to S about Z became 
more equivocal and that the total frequency 
of positive reinforcement to either 5 dropped 
considerably under these conditions of reduced 
information. 

Experiment 3 investigated further extensions 
of the response weighting parameter so as to 
demonstrate the generality of the predictive 
assumptions for all combinations of weightings. 
Experiments 1 and 2 were limited to combina- 
tions of weightings where the response of one 
S was weighted the same in his reinforcement 
stimulus as that of the second S in his rein- 
forcements. Experiment 3 demonstrated when 
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one S’s response weighting in his reinforcement 
is different from the second S’s response 
weighting, optimal cooperative action fails 
to develop where the sum of these two weights 
is less than 1.00, and hence such failures are 
not limited to reward systems where both Ss’ 
responses are weighted less than .50. 

These experiments support the conclusion 
that the combination of response weightings 
in the reinforcements to each S is an important 
parameter in the development of cooperative 
action. The amount of information concomi- 
tant to the presentation of reinforcement to 
each S also appears to play an important role 
in the development of cooperative action and 
in the frequency of reinforcement procured by 
both Ss over a series of trials. 

The status of the present study as a con- 
tribution to a descriptive system of inter- 
personal behavior was briefly discussed. Other 
experimental parameters were suggested and 
the implications of parameters for 
analysis of interpersonal situations were 


sketched. 


these 
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ACHIEVEMENT RELATED CHARACTERISTICS 
OF MANIC-DEPRESSIVES' 


JOSEPH BECKER? 
Duke University 


HE general purpose of the present study 

was to attempt a more objective and 

rigorous testing of hypotheses about 
manic-depressive character derived primarily 
from the clinical research of Cohen, Baker, 
Cohen, Fromm-Reichmann, and Weigert 
(1954).° Their descriptions of manic-depressive 
character were reformulated in terms of 
personality types which have emerged from 
experimental investigations of human moti- 
vation. The Achievement concepts of 
McClelland and his associates (McClelland, 
Atkinson, Clark, & Lowell, 1953; De Charms, 
Morrison, Reitman, & McClelland, 1955) 


provided a particularly appropriate framework 
for examining certain aspects of manic-de- 
pressive character. The formulations of Cohen 
et al. attached considerable significance to 
behaviors and attitudes related to achievement 
and conformity, which are also of central 


importance to McClelland’s concepts. 
According to these writers (Cohen et al., 
1954, pp. 113-119), early parental indulgence 
was abruptly supplanted by approval con- 
ditioned upon the fulfillment of parentai 
demands for rigidly conforming behavior and 
high achievement. This change in the nature 
of parental demands resulted in rigid, im- 
personal standards being foisted upon the 
premanic-depressive child by his parents in 
order to use him as an instrument for raising 
the family’s social prestige. These standards of 
achievement and conformity were determined 


1 This paper is based upon a doctoral dissertation 
submitted to Duke University. The writer wishes to 
express his appreciation to Charles D. Spielberger, the 
dissertation chairman, and Joseph B. Parker, Chief of 
Psychiatry VA Hospital, Durham, North Carolina, 
for their generous assistance. 

? Now at Umstead State Hospital, Butner, North 
Carolina. 

* This investigation was based on a research project 
which studied 12 manic-depressives who were being 
treated by intensive psychoanalytic therapy. The 
research group was comprised of staff members of the 
Washington School of Psychiatry who met in bi- 
monthly, three-hour seminar sessions for approxi- 
mately four years. 


by the parents’ (usually the mother’s) con- 
ception of what the neighburs would think 
proper and appeared to be severely enforced. 

In adulthood (Cohen et al., 1954, pp. 119- 
121), the manic-depressive was caught in the 
dilemma of strongly inculcated achievement 
values based on the need to placate others, and 
and fear of alienating others through his 
success. Achievement was valued primarily as 
a means toward the end of gaining others’ 
approval. Excellence of performance or inde- 
pendent activity as satisfying in itself was 
subordinated to the belief that success was 
attainable only through manipulating others, 
not through one’s own efforts. So excessive is 
the dependence of the manic-depressive on the 
approval of others that he will abandon even 
his rigidly conventional wpinions in favor of 
the perceived values of the needed other, 
particularly if the other is seen as an authority 
figure. 

McClelland and his associates identified two 
relatively independent types of achievement 
orientation. Need Achievement is said to 
characterize persons who are concerned with 
living up to an internalized standard of 
excellence. Their satisfaction is in doing a job 
well rather than in the enjoyment of the end 
product. As children, high need Achievers were 
subjected to more early independence demands 
and had fewer restrictions imposed upon them 
than did low need Achievers (Winterbottom, 
1953). In contrast, value Achievement charac- 
terizes individuals who have responded to 
excessive parental verbal emphasis on 
achievement striving, by a conscious valuing of 
achievement. DeCharms et al. (1955) in 
speculating about the origins of value 
Achievement state that persons “who describe 
themselves as ambitious and achievant may do 
so for defensive reasons; they have perhaps 
been under some authoritarian pressure from 
their parents to be ambitious and the resultant 
motive which has originated in external sources 
shows itself primarily as a fear of being un- 
successful” (p. 419). Whereas high need 
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Achievement has been shown to be posi- 
tively related to better performance on tasks 
involving “internalized standards of excel- 
lence” and negatively related to conformity 
and authoritarianism (Brown, 1953; 
McClelland et al., 1953; Wendt, 1955), high 
value Achievement is unrelated to performance 
involving internalized standards of excellence 
and is positively related to conformity behavior 
and authoritarian attitudes (DeCharms et al., 
1955). 

According to the conceptualization of Cohen 
et al. (1954), manic-depressives and high value 
Achievers seem to share a common defensive 
reaction pattern consisting of outwardly 
adopting the prevailing values of others in 
order to placate and win needed approval 
from them. Dependency on others for guidance 
and approval is manifested in the behavior of 
both manic-depressives and high value 
Achievers through conventional attitudes, 
stereotyped social interactions, and con- 
formity. However, Cohen et al. have also 
asserted that premanic-depressive children 
frequently assumed early responsibility, and as 
adults often became financially successful. 
Such behavior would not be inconsistent with 
what might be expected of McClelland’s high 
need Achievers. Furthermore, Arieti (1957) 
has implied that manic-depressives are like 
Riesman’s inner-directed character type 
(Riesman, Glazer, & Denny, 1950), a type 
with which McClelland (1955) has also identi- 
fied the high need Achiever. 

Since phenotypical characteristics and the 
socialization process of manic-depressives 
seemed to resemble those of high value 
Achievers more closely than those of high need 
Achievers, the hypotheses of this study pre- 
dicted that manic-depressives would perform 
like high value Achievers, as compared to a 
nonpsychiatric control group, on a number of 
tasks which differentiate need and value 
Achievement performance. 

The following predictions were made: 


Hypothesis 1: Manic-depressives describe themselves 
as more ambitious and achievant than the controls. 

Hypothesis 2: Manic-depressives subscribe more 
strongly to authoritarian ideology than the controls. 
Both high authoritarians and manic-depressives have 
been reported to share such characteristics as intoler- 
ance of ambiguity, social imperceptiveness, stereotypy 
and rigidity in behavior and attitudes, and strong 
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conformity tendancies (Titus & Hollander, 1957; 
Cohen et al., 1954). 

Hypothesis 3: Manic-depressives profess a more 
traditional ideology with regard to family roles and 
statuses than the controls. 

Hypothesis 4: Manic-depressives are not significantly 
different from the controls in strength of need Achieve- 
ment phantasy. 

Hypothesis 5: Manic-depressives are not significantly 
different from the controls in their achievement on 
performance tasks under either task or achievement 
oriented conditions. 

If, however, the manic-depressives more closely 
resemble high need Achievers, then on the operational 
measures: (4) the manic-depressives would not be 
expected to differ from the controls on the value 
Achievement scale; (6) the manic-depressives would be 
expected to score lower on the California Fascism 
scale, and the Traditional Family Ideology Scale; and 
(c) the manic-depressives would be expected to score 
higher in need Achievement phantasy and the per- 
formance tasks. 


METHOD 

Subjects 

The subjects (Ss) were 24 remitted manic-depressives 
and 30 nonpsychiatric controls. All Ss were male white 
veterans, aged 25 to 65, with at least sixth-grade 
reading ability; most were born and reared in the 
Southeastern United States. Remitted manic-depres- 
sives were used in this investigation in order to study 
manic-depressive character undistorted by the con- 
founding effects of severe affective disturbance and 
treatment regimens. These cases were carefully selected 
on the basis of Kallman’s (1953) definition of manic- 
depressives as “cyclic cases which show periodicity of 
acute, self-limited mood swings before the fifth decade 
of life and no progressive or residual personality dis- 
integration’ before or after such episodes” ( p. 130), 
the absence of Lewis and Piotrowski signs (Lewis & 
Piotrowski, 1954), and judgments of clinical remission 
discussed in detail elsewhere (Becker, 1958). All of the 
manic-depressive patients had been at the Durham 
VA Hospital some time during the past five years. 


TABLE 1 


COMPARISON OF MANIC-DEPRESSIVES AND 
CONTROLS ON MATCHING VARIABLES 





| Manic-Depres- 


sives (N 24) Controls (N 30) 


” 





| Me- | SD M 


|Mean SD 


e- | 
| dian | dian |Mean 


——s | 

Age | 41.50 44.13)10.56 38.50) 43.17/12. 10 

Education | 12.60) 12.13] 2.91 | 11.50! 12.17) 3.66 

Ohio | 36.50| 36.50) 5.00 | 37.67| 36.50) 5.98 
| 


Literacy | | | | 


-78 
77 
-70 





382.5 











* A probability value of p > .05 (two-tailed tests) was set as 
the level of significance required for rejection of the null hy- 
pothesis in all of the statistical procedures of this study. 
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The control group consisted of 30 Ss who were free 
of psychosomatic symptoms, chronic physical disorder, 
and psychiatric illness. The control Ss were comparable 
to the manic-depressives for age, education, and reading 
ability. The medians, means, and standard deviations 
of the experimental and control groups on the equated 
variables are presented in Table 1. 

Since the data for age, education, and literacy were 
either markedly skewed or rectangularly distributed, 
the Mann-Whitney U Test was used to determine 
whether the groups had been drawn from the same 
population with regard to the control variables. There 
were no. significant differences between manic- 
depressives and controls on any of these variables. 


Procedure 


Nine experimental tasks were administered to each 
S in an individual testing session which required 
approximately 90 min. The following orienting in 
structions were given in a conversational manner: 

The purpose of the tasks you’re about to take is 
to allow evaluation of some important attitudes and 

behaviors that seem to have a lot to do with how a 

person gets along outside the hospital. We want to 

compare people who have had some emotional 

difficulties, but recovered or improved, with others 

who have never been hospitalized for this reason. 
The intent of these instructions was to stimulate 
interest in the tasks without unduly arousing an 
achievement or affiliative orientation. 

The experimental procedures were introduced by 
asking the Ss to read the instructions for the Hildreth 
Mood Scale (Hildreth, 1946) and to fill it out. They 
were then administered a four-card need Achievement 
measure (“Inventors,” “Boy in Checkered Shirt,” 
“Trapeze Artists,” and “Architects’”) which deviated 
from the standard procedure in that Ss were tested 
individually rather than in a group and the usual 5-min. 
time limitation per story was not imposed. Most Ss, 
however, finished the individual stories within a 5-min. 
period. 

After completing the need Achievement measure, 
the Ss were given the Ohio Literacy Test (Foster & 
Goddard, 1924 

Next, the three attitude scales—value Achievement 
scale (DeCharms et al., 1955), F scale (Adorno, Fren- 
kel-Brunswik, Levinsor, & Sanford, 1950), and TFI 
Scale—(Levinson & Huffman, 1955), were administered 
together. Since instructions for each of these scales 
are similar, only one set of instructions was given. 
After the attitude scales were completed, the IPIT 
was given.‘ 

The two performance measures were the final tasks 
administered. The Xs-in-Circles task (Atkinson & 
Reitman, 1956) was given with task oriented in- 
structions, followed by the Additions task (Williams, 
1955). Achievement oriented instructions were used 
with the latter task, stressing speed, accuracy, and 
competition. 


‘ Because of the difficulty in generating hypotheses 
within a need or value Achievement framework, data 
for this measure are not reported in this study. 
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The finding that the manic-depressives did 
not differ significantly in affective state and 
that as a group they were not more affectively 
variable at the time of testing than the controls 
supports the clinical judgment that the manic- 
depressive Ss were in remission. This finding 
also supports the contention that any obtained 
differences on the experimental measures are 
not primarily due to differences in momentary 
affective state. 

The medians, means, and standard devi- 
ations of the experimental measures are 
reported in Table 3. The differences between 
the manic-depressives and the controls in the 
experimental measures were evaluated by / 
tests when the data were approximately 
normally distributed and by nonparametric 
Mann-Whitney U tests when the data departed 
markedly from a normal distribution. The 
findings relative to the hypothesized perform- 
ance of manic-depressives as value Achievers 
were as follows: 


Hypothesis 1: The scores for the manic-depressives 
on the value Achievement scale were significantly 
(U = 214, » < .02) higher than those of the controls. 
This finding confirms Hypothesis 1; the manic-de- 
pressives placed a strong, positive, conscious valuation 
on achievement. 

Hypothesis 2: Manic-depressives obtained a sig- 
nificantly (¢ = 2.05, » < .05) higher mean score than 
controls on the F scale. This finding confirms Hypoth- 
esis 2; the manic-depressives performed in a manner 
similar to that of high value Achievers on the F scale. 
High scorers on this scale tend to be characterized by 
rigidly conventional authoritarian attitudes, con- 
formity, intolerance of ambiguity, and social imper 
ceptiveness. 

Hypothesis 3: The mean score of the nianic-de- 
pressives on the TFI scale was significantly (¢ = 2.74, 
p < .01) higher than was that of the controls. This 
confirms Hypothesis 3; the manic-depressives per- 
formed in a manner similar to that which would be 
expected of high value Achievers. High scorers on the 
TFI scale are said to be characterized by attitudes of 
submission to authority, extreme emphasis on disci- 
pline, and rigidly defined roles and statuses for family 
members. 

Hypothesis 4: The distribution of scores for the 
manic-depressives on the need Achievement measure 
was not different from the distribution of the controls 
(U = 368). The fact that manic-depressives do not 
score as would be expected if they were high need 
Achievers is not inconsistent with their being high 
value Achievers and supports Hypothesis 4. 

Hypothesis 5: The scores of the manic-depressives 
were not different from those of the controls on either 
the Xs-in-Circles Task (U = 372) or the Additions 
Task (U = 309). The manic-depressives did not 
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perform more rapidly under either the “task” or 
“achievement” oriented instructions. If the manic- 
depressives were high need Achievers, they would be 
expected to score higher on either one or both of these 
tasks. The finding that manic-depressives did not 
score higher is not inconsistent with their performing 
like high value achievers and supports Hypothesis 5. 


RESULTS 

The Hildreth scale served the twofold 
purpose of checking the reliability of the 
psychiatric judgment of clinical remission and 
affording a basis for evaluating the effect of 
mood state on the experimental measures. The 
means and variances of the mood scale scores 
for the manic-depressives and controls are 
presented in Table 2. A ¢ test of the difference 
between the means was not significant, nor 
was an F test (df 23, 29) of the differences 
between the variances. 

In general, the manic-depressives performed 
in a manner which would be consistent with 
their being high value Achievers on all of the 
tasks administered. On none of the measures 
did the manic-depressives perform as would 
be expected of high need Achievers. 


Effects of Age and Educational Level 

Age and educational level have been reported 
to affect several of the experimental measures 
used in this investigation. The effects of these 
variables were partially controlled by re- 
stricting the age range (25-65) and by re- 
quiring a sixth-grade reading ability. In ad- 
dition, the effects of age and education within 
the range of subject variability were examined. 
Mean scores on each of the experimental 
measures for manic-depressives and for con- 
trols above their respective median ages were 
compared with mean scores for those below 
the median ages; similarly mean scores for Ss 
who exceeded the median educational level 
were compared with scores for Ss who were 


TABLE 2 
COMPARISON OF MANIC-DEPRESSIVES AND 
CONTROLS FOR SUBJECTIVE Moop 
STATE 


Manic Depres- 


sive (N 24) Control (N 30) 


Statistic 


23.82 
11.36 
18.3-30.3 


Mean 
Variance 
Range 


TABLE 3 
COMPARISON OF MANIC-DEPRESSIVES AND 
ConTROLs ON Srx EXPERIMENTAL 
MEASURES 








Manic —_— Controls (N 30) 


Measures 





M » Me- 
di : dian x 


| 
| 
| 
| 


v Achievement . 4.86) 1.00 
F scale ‘ 4.57 .87 
TFI scale 29} 4.19] .73) 3.76] 3.67 <.01 
n Achievement | si 5.38) 3.98} 5.00) 5.77) 4.68 .88 
Xs-in-Circles |562.00)571. 17|122. 28|560. 00)573.83)114.92 .83 
Additions* |S0s.00 534 58|230. 17|388. 50/497. 13}220. 37 


«2 4.16 


4.17) 4.09 <.05 











"Scores are in total seconds to complete the 200 addition 
problems. 


below the median educational level (Becker, 
1958). 

Value Achievement scores increased signifi- 
cantly with age for the manic-depressives 
(p = <.05) but not for the controls. Whereas 
level of education was inversely related to 
scores on the F scale (p = <.05), and TFI 
(p = <.05) for the controls, no significant 
relation was obtained for the manic-de- 
pressives. 

In order to determine whether the controls 
were representative of nonpsychiatric popu- 
lations generally, their performance was com- 
pared with normative data for comparable age 
and educational levels. No significant differ- 
ences were found (Becker, 1958). 


DISCUSSION 


Manic-depressives in this study performed 
like high value Achievers with respect to 
achievement striving, attitudes toward au- 
thority, and conventional beliefs. In the manic- 
depressives as in the high value Achievers, 
achievement and conformity behavior seem to 
have the goal of gaining the approval of others 
rather than of satisfying internalized stand- 
ards. The defensive conforming nature of the 
manic-depressive’s achievement strivings, and 
the rigidity of his highly stereotyped, con- 
ventional attitudes are further supported by 
the effects of age and educational level on these 
variables. Whereas achievement values were 
independent of age for the control Ss, the 
strong conscious valuation of achievement for 
manic-depressives increased with age. Pros- 
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pects of diminished capacity with increasing 
age in those excessively dependent on external 
sources for self-esteem may cause the manic- 
depressive to strengthen his achievement val- 
ues since he perceives them as a means of gain- 
ing needed approval. With respect to rigidity of 
attitudes, the manic-depressives’ high au- 
thoritarian attitudes as measured by the F 
scale and TFI are unmodified by education. 
In the general population, authoritarian atti- 
tudes decrease as a function of educational 
level. 

The results of the present study, in addition 
to indicating that the manic-depressive per- 
forms as a high value Achiever, served to 
verify empirically findings of Cohen et al., 
concerning a number of manic-depressive 
character traits. The reformulation of psycho- 
analytically derived descriptions of manic- 
depressive character within the framework of 
McClelland’s experimentally derived concepts 
of achievement striving provided for a more 
objective and rigorous evaluation of these 
traits. The generality of the findings of Cohen 
et al., were extended by the present results to 
a somewhat different subculture of predomi- 
nately white Southern Protestants. Further- 
more, empirical verification of a number of 
behavior traits attributed to manic-depressives 
lends inferential support to their etiological 
formulation. 

The findings of relatively specific adult 
achievement and conformity traits in manic- 
depressives suggests that hereditary-consti- 
tutional factors, are probably not sufficient in 
themselves to account for manic-depressive 
psychosis. These findings are, to a certain 
extent, compatible with speculations by tra- 
ditional psychoanalytic writers (Fenichel, 
1945) about the etiology of manic-depressive 
psychosis in terms of oral character structure. 
The conformity behavior of manic-depressives, 
which obtains the approval of others for them, 
is consistent with the oral character’s intense 
dependency on others. Oral character expla- 
nations of manic-depressive psychosis, how- 
ever, typically neglect the specification of 
socio-environmental factors. The findings of 
the present study support those of several 
recent inquiries (Finley & Wilson, 1951; 
Gibson, 1958; Wilson, 1951) which are in 
accord with the stress by Cohen and her 


collaborators on the importance of inter- 
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personal experience in the etiology of manic- 
depressive character. 

The results of the present study indicated 
that McClelland’s Achievement concepts pro- 
vided a useful framework within which to 
study pathological character. The utility of 
the value Achievement concept, however, was 
limited by its relatively undeveloped status. 
Greater research leverage could be achieved 
by translating manic-depressive character into 
a personality type whose characteristics, corre- 
lates, and genesis have been more compre- 
hensively studied. The concept of the au- 
thoritarian personality, studied by a wide 
range of psychological techniques (Adorno, 
Frenkel-Brunswik, Levinson, & Sanford 1950; 
Titus & Hollander, 1957), seems to offer con- 
siderable research potential in this respect. 
In the present study, maanic-depressives scored 
high on measures of authoritarianism and 
conventional attitudes. This finding suggests 
that such correlates of high authoritarianism 
as intolerance of ambiguity, social imper- 
ceptiveness, and various types of rigidity may 
be productively explored with manic-depres- 
sives. 

SUMMARY 

The present study investigated the 
achievement and conformity behavior of 
manic-depressives within the framework of 
McClelland’s concepts of achievement striving. 
On the basis of similarities between 
McClelland’s high value Achievers and manic- 
depressives as described by Cohen et al., 
specific hypotheses about the performance of 
manic-depressives on tasks relevant to value 
Achievement were formulated. Other tasks 
were included to test the possibility that the 
achievement strivings of manic-depressives 
may be similar to those of the high need 
Achiever. Twenty-four carefully selected 
manic-depressives were matched with thirty 
nonpsychiatric controls for age, education, and 
literacy level. All of the subjects were male 
white veterans between the ages of 25 and 65, 
with a minimum of sixth-grade reading ability; 
most were born and reared in the South. Nine 
experimental measures were used. 

The manic-depressives scored significantly 
higher than the controls on value Achievement, 
the F scale, and the TFI scale. There were no 
differences in the scores of manic-depressives 
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and controls on a need Achievement measure 
or on either of two performance tasks. Thus, 
the manic-depressives performed as high value 
Achievers in that they showed a strong 
conscious valuation of achievement and highly 
authoritarian and conventional attitudes. The 
effects of age and educational level on these 
measures added further support to the per- 
formance of manic-depressives as value 


Achievers. On none of the measures did the 
perform as 


manic-depressives high need 


Achievers. 
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THE INTERPRETATION OF OPINION STATEMENTS AS A 
FUNCTION OF RECIPIENT ATTITUDE' 
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LTHOUGH there is general agreement 
that a person’s attitudes influence his 
interpretation of opinion statements, 

the direction and magnitude of this influence 
is as yet unclear. One theory, which has re- 
ceived perhaps its clearest statement from 
Festinger (1957), leads to the prediction that 
in interpreting statements of opinion, the 
recipient minimizes the apparent disparity be- 
tween his own views and the views of the com- 
municator. Presumably, so doing reduces dis- 
sonance and the tension that it commonly 
produces. Message distortion of this type has 
been reported by a number of investigators 
(Allport & Postman, 1952; Cooper & Jahoda, 
1947; Edwards, 1941; Prothro, 1955). 

The views of Hovland, Harvey, and Sherif 
(1957) and their associates stand in marked 
contrast to Festinger’s (1957) position, al- 
though to the best of the writer’s knowledge 
there has been scant recognition of this fact 
and a critical comparison of the two theories 
has not been attempted. Instead of focusing 
attention on the motivational consequences 
of communicator-recipient disagreement, the 
Hovland-Sherif group rely heavily on the re- 
sults obtained in psycho-physical experiments. 
They cite a number of studies which show that 
in many judgmental tasks, introducing anchors 
(standards) which are at the end of the series 
of stimuli to be judged, or slightly removed 
from the end, results in a displacement of 
the scale of judgment foward the anchor (the 
so-called “assimilation effect’). When the 
reference point is too remote, however, the 
displacement is away from the anchor (the 
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“contrast effect’). Extrapolating from these 
findings, Hovland, Harvey, and Sherif (1957) 
are led to the possibility that “when Ss have 
established attitudes and are personally in- 
volved in a controversial social issue, their 
‘own stand’ functions as the major anchorage 
affecting reaction to and evaluation of commu- 
nication. In this case, communication near S’s 
own stand would be assimilated to it, while 
communication at variance with S’s own stand 
would be displaced still further away (con- 
trast effect). Whether assimilation or contrast 
effects appear would be a function of the rela- 
tive distance between 5S’s own stand and the 
position of communication.” These investiga- 
tors further speculate that contrast effects 
should begin to appear when the message ad- 
vocates a position that is unacceptable to the 
recipient; if the message advocates a position 
that is within S’s “latitude of acceptance,” 
or slightly beyond this range, assimilation 
effects are predicted. 

Prothro’s results (1957) are in general accord 
with these principles, as are the findings of 
Hovland, Harvey, and Sherii (1957), who 
originally proposed the theory; in the latter 
study, however, the data obtained from Ss 
who were expected to show the assimilation 
effect offered only weak support for the theory. 
Other studies (Hovland & Sherif, 1952; Sherif 
& Hovland, 1953) have reported contrast 
effects, without revealing any evidence of 
assimilation under conditions of minimal con- 
flict between the views of communicator and 
recipient. 

In addition to their work relating attitudes 
to message interpretation, Sherif and Hovland 
(1953) have shown that attitudes can affect 
the extension or constriction of the scale that 
the individual uses in judging opinion state- 
ments. These investigators asked white and 
Negro Ss, who were chosen because of their 
conflicting social attitudes, to sort state- 
ments about the Negro into as many categories 
as they considered necessary; Negro Ss used 
fewer categories than did the whites. The re- 
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sults were interpreted as indicating that “‘in- 
tensely involved individuals with an extreme 
position are more likely to adopt a constricted 
scale (with few categories).” 

The present experiment represents an at- 
tempt to obtain additional data pertaining to 
the effect of recipient attitude on (a@) message 
interpretation and (0) scale constriction. 

METHOD 
Messages 

A class of 35 undergraduate students wrote anony- 
mous statements explaining their views on college 
fraternities. From this pool, four profraternity, four 
antifraternity, and four relatively neutral messages 
were selected for inclusion in the experiment. The 
primary criterion in selecting the messages was the 
authors’ ratings of the concept college fraternilies on 
the following evaluative scales: kind-cruel, valuable— 
worthless, honest-dishonest, fair-unfair, nice~awful, 
and good-bad. The six scales were presented graphically 
in a seven-interval format and were arbitrarily rotated 
to reduce response set, i.e., the ““good’’ pole was on the 
left for half the scales and on the right for the other 
half. The obtained ratings were summed across scales, 
after a value of 1 had been assigned to the “good” end 
and 7 to the “bad” end of each scale. Previous research 
by Osgood and his collaborators (Osgood, Suci, & 
Tannenbaum, 1957) has shown that attitude scales of 
this type have satisfactory reliability and correlate 
highly with the more conventional Thurstone and 
Guttman scales. The authors’ ratings were obtained 
immediately after their messages had been completed. 


Recipients 


The experimental messages were presented to 70 
undergraduate psychology students. Before reading 
the messages, the recipients’ attitudes toward fraterni- 
ties were assessed by means of the rating scale proce- 
dure. They then read the experimental passages and 
attempted to predict each writer’s ratings of college 
fraternities, again using the six evaluative rating scales; 
these predictions were summed across scales to yield a 
measure of each communicator’s judged position on 
the attitude continuum. The passages were assembled 
in a constant, chauce-determined order, with the 
restriction that no two messages expressing a similar 
viewpoint could appear in succession. 

In order to compare the judgments of Ss who held 
varying attitudes, the total sample of recipients was 
divided into quintiles on the basis of their attitude 
scores; there were 14 Ss in each quintile. 


RESULTS AND DISCUSSION 


Figure 1 presents the mean responses of the 
five recipient-groups to the profraternity, neu- 
tral, and antifraternity messages; these re- 
sponses represent the recipients’ predictions 
as to the views held by the writers. The re- 
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Fic. 1. Mean predictions as a function of attitude. 


sults for each type of message are shown sepa- 
rately, with the recipient-groups arranged 
from left to right, in order of decreasing favor- 
ability toward fraternities; predictions that are 
high on the ordinate reflect a belief that the 
communicator disliked fraternitics. An analysis 
of variance revealed that in responding to the 
antifraternity passages, the recipient groups 
differed significantly (¢ < .05); for the pro- 
fraternity and the neutral messages, the group 
differences fell short of significance (p < .10).? 

Interestingly enough, the responses to the 
“pro” and the “anti” messages show curvi- 
linear trends that are essentially mirror images 


2 With 4 and 65 degrees of freedom, an F of 2.51 
was needed for significance at the .05 level. For the 
antifraternity messages, F equalled 2.79; for the 
profraternity messages, F equalled 2.49; for the neutral 
messages, F equalled 2.29. 

Application of the analysis of variance for trends 
(Grant, 1956; Grandage, 1958) revealed significant 
quadratic components (p < .01) for the profraternity 
and antifraternity messages and a significant linear 
component (p < .01) for the neutral messages. 
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of one another. In both cases, the more strongly 
committed the S, the greater his tendency to 
regard the communicators as holding relatively 
extrerne views. While these results seem at 
variance with Festinger’s position, they can 
be interpreted in terms of the Hovland, 
Harvey, and Sherif model, if we assume that 
each S’s latitude of acceptance included at 
least two types of messages. Specifically, it is 
assumed (a) that a pro S would find both pro 
and neutral messages within his latitude of 
acceptance, (b) that an anti S would regard 
anti and neutral messages as acceptable, and 
(c) that a neutral S would regard messages on 
both sides of his own position, i.e., both pro 
and antimessages, as being within his latitude 
of acceptance. Starting with these assumptions, 
and focusing attention on the extremely pro, 
the extremely anti, and the neutral Ss, the data 
obtained from the profraternity messages can 
be interpreted as follows: the pro Ss appear to 
have displaced the communicators foward the 
end of the scale that they themselves favored 
(the assimilation effect); that is, the mean pre- 
diction of these Ss was more profraternity 
than the mean prediction of all Ss combined. 
Since profraternity messages were within the 
pro’s latitude of acceptance, a shift of this 
sort would be consistent with the Hovland, 
Harvey, and Sherif theory. The anti Ss seem 
to have displaced the procommunicators away 
from their own anti stands (the contrast 
effect),* a result which is also consistent with 
the theory, since profraternity messages would 
doubtless be outside the anti’s latitude of 
acceptance. Finally, the neutral Ss seem to 
have shifted the procommunicators toward 
the midpoint of the scale (assimilation); this 
too is in accord with the theory, if we assume 
(as above) that neutral Ss would regard 
pro messages as acceptable. Similar reasoning 
can be applied to the data obtained from the 
antifraternity messages. 

The predictions made by the moderaiely 
pro- and moderately antifraternity Ss, in re- 
sponse to the two extreme groups of messages, 
are somewhat more difficult to interpret. These 
Ss apparently responded much as the neutrals 
did, in that their general tendency was to 

* This statement and subsequent statements regard- 
ing judgmental shifts are based on a comparison of the 


mean prediction given by a specific group of Ss (here, 
the antis), and the mean prediction of all Ss combined. 
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assimilate both pro- and antifraternity mes- 
sages toward their own positions. In order to 
integrate these data with the Hovland-Harvey- 
Sherif theory, one would have to assume that 
both extreme viewpoints were within the 
“moderates’” latitude of acceptance, or 
slightly beyond this range. 

On the neutral messages, there was a linear 
relationship between attitude and interpreta- 
tion of messages; Ss who favored fraternities 
felt that the neutral communicators were 
more profraternity than did Ss who opposed 
them. While this assimilation trend is clearly 
in accord with Festinger’s viewpoint, it is also 
consistent with the Hovland, Harvey, and 
Sherif model, if we assume (as above) that neu- 
trality was an acceptable position to all re- 
cipients. 

It is interesting to contrast the results of the 
present experiment with those reported by 
Segall (1959). This investigator had university 
students make absolute judgments of state- 
ments about fraternities and found virtually 
no differences between judges of varying atti- 
tude. He reasons that his negative results may 
have occurred because of the relatively narrow 
range of opinions represented in his Ss, and 
this may indeed be a crucial difference between 
the two studies, for the present data were col- 
lected on a big city campus, where the fra- 
ternity system was probably more contro- 
versial than it was in the setting that Segall 
describes. It is also possible that the different 
methods employed in the two studies were 
crucial in determining their conflicting results; 
that is, while Segall’s Ss were asked to judge 
the extremity of opinion statements, Ss in the 
present experiment did not judge the state- 
ments; instead, they predicted the positions 
advocated by the persons who had made the 
statements. It may be that this latter set is 
more likely to call the S’s own views into play 
as an implicit standard and determinant of 
his response. 


Altitude and Scale Constriction 


Scale constriction was measured by com- 
puting a standard deviation for each S’s pre- 
dictions over the set of 12 messages. Figure 2 
presents the mean standard deviation for each 
of the five groups of recipients. The groups 
are again arranged in order of decreasing favor- 
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Fic. 2. Variability of predictions as a function of 
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ability toward fraternities, with the pro- 
fraternity Ss on the left and the antifraternity 
Ss on the right. An analysis of variance re- 
vealed that the difference between the group 
means was significant at the .025 level. As will 
be seen, the relationship between attitude and 
scale constriction is curvilinear. Ss with rela- 
tively neutral views perceived the messages as 
reflecting a narrower range of opinion than did 
Ss who were committed on the fraternity issue; 
apparently the neutral Ss felt that there were 
only minor differences between the various 
communicators. This trend is perhaps seen 
most clearly if Fig. 2 is “folded over,” resulting 
in the formation of three recipient-groups: the 
highly committed Ss, the moderate Ss, and the 
neutral Ss. The mean standard deviations for 
these groups are 11.9, 10.7, and 9.6 re- 
spectively. 

It is likely that these constriction differ- 
ences represent a “byproduct” of the displace- 
ment trends that were discussed in connection 
with Fig. 1. That is, the neutral and moderate 
Ss may show a more constricted scale of judg- 
ment than the committed Ss because they 
tended to displace the communicators /oward 
the midpoint of the scale, while the committed 
Ss displaced the communicators away from the 
midpoint. 

Although the results obtained from the 
pro and the anti messages (see Fig. 1), and the 
results shown’ in Fig. 2, could not readily be 
attributed to random error, they could have 
been due to differences in response set; more 
specifically, the relatively uncommitted Ss 
may have been unwilling to use the extremes 
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of the scales, regardless of what they were 
rating. A response set of this sort could have 
caused these Ss to (a) respond in a neutral 
fashion to the attitude scales and (6) reduce 
their range of judgments when responding to 
the experimental messages. To check on this 
possibility, the five recipient groups were com- 
pared in terms of their ratings of two unrelated 
concepts—jazz and labor unions. These ratings 
had been obtained in order to familiarize the 
Ss with the use of the scales. If the constriction 
differences discussed above were, in fact, due 
to differences between the groups in their will- 
ingness to use the extremes of the scales, the 
neutrals’ ratings should have shown only slight 
deviations from the midpoint, the moderates 
should have shown greater deviations, and the 
committed Ss should have been most willing 
to use the extremes. Examination of the data 
indicated that this was mot the case; in fact, 
the neutral Ss showed a greater willingness to 
deviate from the midpoint than did either of 
the other two groups. The relationship between 
attitude and scale constriction, which is shown 
in Fig. 2, is therefore not attributable to gen- 
eralized response set differences. 

While the constricted range of the neutral 
Ss’ predictions was quite marked in the present 
study, the obtained results seem somewhat at 
variance with the interpretation that Sherif 
and Hovland (1953) offer for their data. As 
noted above, these investigators found that 
Negro Ss used a relatively constricted scale in 
judging items dealing with the Negro, and 
concluded that they did so because of their 
intense involvement with the experimental 
issue. The results of the present experiment 
seem to contradict this interpretation, if we 
make the assumption that the neutral Ss were 
probably Jess intensely involved in the fra- 
ternity issue than the committed Ss. While 
these discrepant findings may be attributable 
to differences in experimental procedure or 
topic, it may be that the Sherif and Hovland 
results were not due to their Ss’ varying de- 
grees of involvement in the experimental task; 
instead, they may reflect the operation of 
cultural, educational, or intellectual factors 
which might have caused the Negro Ss to use 
a relatively constricted scale, regardless of the 
stimulus materials that they were asked to 
judge. 
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SUMMARY 


Five groups of college students, who varied 
in their attitude toward fraternities, were 
presented with the written statements of 12 
communicators, who had been asked to explain 
their views on fraternities. There were three 
types of communicators: those who favored 
fraternities, those who opposed them, and 
those who were relatively neutral. After read- 
ing each message, the recipients were asked to 
indicate the communicator’s attitude toward 
fraternities, as defined by his response to an 
attitude scale. 

In responding to the profraternity and the 
antifraternity messages, there was a curvi- 
linear relationship between attitude and judg- 
ment: in both cases, the committed Ss per- 
ceived the communicators as occupying more 
extreme positions than did the neutral Ss. In 
responding to the neutral messages, the re- 
lationship between attitude and judgment was 
linear: Ss who favored fraternities felt that the 
neutral communicators were more profrater- 
nity than did the Ss who opposed them. These 
results were interpreted as being generally 
consonant with a theory developed by 
Hovland, Harvey, and Sherif (1957). 

A related finding was the fact that the neu- 
tral Ss used a relatively constricted scale of 
judgment; they clustered their predictions 
within narrower limits than did the committed 
Ss, apparently feeling that the messages did 
not reflect a particularly wide range of opinion. 
If we make the assumption that the neuiral 
Ss were probably /ess intensely involved in the 
fraternity issue than the committed Ss, these 
results appear to contradict Sherif and 
Hovland’s contention that intensely involved 
individuals with an extreme position are likely 
to adopt a constricted scale of judgment. 
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HERE has been a recent resurgence of 

interest among psychological researchers 

in the predictive value of a postulate 
that people tend to maintain logical con- 
sistency among their cognitions (and even 
between cognitions and more gross behavior). 
The most influential single source of this 
revived interest is probably Heider’s (1946, 
1958) “balance” theory, while Festinger’s 
(1957) theory of cognitive dissonance is 
probably the most stimulating elaboration 
to date. Most of the approaches that employ 
this postulate recognize, implicitly or explicitly, 
that the tendency towards consistency is not 
absolute. Its expression is limited by other, 
potentially conflicting forces in the person 
and by the inadequacies of his cognitive appa- 
ratus, even in the absence of conflicting forces, 
to achieve full consistency. These coordinate 
hypotheses of cognitive consistency and its 
predictable limitations have been applied 


successfully to the prediction of behavior in 


many areas. The work reported here involves 
two applications in the area of persuasion. 

The first of these applications involves 
what will be called the “Socratic method” of 
persuasion. The persuasion technique usually 
studied in the social sciences involves inducing 
a change in a person’s opinion by presenting 
him with a persuasive message from an 
external source, either through a “‘mass” one- 
way communication channel or in a face-to- 
face reciprocal communication network. The 
postulate of cognitive consistency suggests 
that persuasion could be effected by the quite 
different technique of eliciting the persuasive 
material from the person’s own cognitive 
repertory, rather than presenting it from 
outside. Specifically, this technique would 
involve asking the person his opinions on 
logically related issues, thus sensitizing him 
to any inconsistencies that exist among his 
stands on these issues and producing a change 
towards greater mutual consistency. The 
technique has an obvious affinity to the “So- 
cratic method” used in the Platonic dialogues 
(Jowett, 1937). 


The derivation of the foregoing hypothesis 
requires, besides the postulate of a tendency 
to maintain cognitive consistency, the assump- 
tions that various distorting factors (of which 
“wishful thinking” is given primary attention 
in this study) have introduced inconsistency 
among the initially elicited set of opinions, 
and that the temporally contiguous elicitation 
of these opinions brings the tendency towards 
consistency more strongly to bear upon them. 

Part of the study described below was 
designed to test the hypothesis that a person 
tends to change his opinions on logically 
related issues in the direction of greater 
mutual consistency when he is merely asked 
to state these opinions in close tezaporal 
contiguity. Several additional hypotheses 
regarding the focus of the Socratic effect, 
that is, the type of opinion most likely to be 
adjusted in case of inconsistency, were also 
tested. 

The second application of the cognitive 
consistency postulate to the area of persuasion 
in the present experiment dealt with the logical 
repercussions of persuasive communications. 
The effectiveness of a persuasive message is 
usually gauged by the amount of change in 
opinion that it induces on the issue with which 
it explicitly deals. From the postulated con- 
sistency tendency, however, it would follow 
that an induced change on the explicit issue 
tends also to produce changes on logically 
related (unmentioned) issues such as would 
tend to maintain internal consistency among 
beliefs. 

From the additional, limiting assumption 
that there is a certain amount of inertia in the 
person’s cognitive apparatus, two additional 
hypotheses were derived. It was hypothesized 
that the consistency-maintaining change ex- 
pected on remote issues is less than the amount 
logically required for complete consistency 
with the induced changes on the explicit issues. 
Further, it was hypothesized that inertia 
would also result in temporal effects, such 
that the impact of the message on the remote 
issue occurs only gradually, the opinion on the 
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remote issue continuing to change in the 
logically required direction for some time after 
receipt of the persuasive message. 

Since most of these hypotheses involve pre- 
dictions of fine graduations of change towards 
or away from a reference point of precise 
cognitive consistency, we need a quantitative 
definition of consistency. Such a definition is 
provided by a model combining formal logic 
and probability theory, the relevant aspects 
of which are summarized below. 

Consider three issues so interrelated that a 
person taking Stands @ and 5 on two of these 
issues would, if he were to be consistent, be 
logically required to take Stand c on the third. 
That is, the three stands are in a syllogistic 
relationship, with c following as a valid con- 
clusion from the conjunction of a and 9d. If 
the person’s opinions on these issues are ob- 
tained by having him indicate his adherence 
to each of the Stands, a, 5, and c, on a proba- 
listic scale, we can specify that to be com- 
pletely consistent, these probabilistically scaled 
opinions must be interrelated in the form 


p(c) = pla & b) + plk)(l — pla & d)) 
(1) 


where p(%) is his opinion of the probability 
of c on bases other than the conjunction of a 
and 6. It can also be specified that if we induce 
a change of opinion on a and/or 6 (for example, 
by a persuasive communication) while p(&) 
does not change appreciably, then the required 
change on ¢ necessary for maintaining logical 
consistency among the opinions is 


Ap(c) = Ap(a & 6) [2] 


which, assuming that a and 6 are independent 
events i.e., that p (a &b) = p(a) p(b) becomes 


Ap(c) = Ap(a)p(d) + A p(b)p(a) 
+ Ap(a)Ap(d) [3] 


where p(a) and (6) refer to the initial opin- 
ions, before the communication induced 
changes. When, in the discussion below, we 
refer to the “logically required change on the 
conclusion,” we are referring to the Ap(c) 
value specified in Equation 3. 


METHOD 


Material. The S3’ opinions on sets of logically related 
issues were measured by means of a questionnaire that 
contained 48 statements. These statements comprised 
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16 sets of three syllogistically related propositions; 
that is, two of the propositions stated stands on a 
pair of issues, agreement with which would logically 
imply agreement with the stand taken in the proposition 
on the third issue.! The three propositions making up 
any one syllogism appeared in different parts of the 
questionnaire, being interspersed with the propositions 
of other syllogisms. 

The Ss were asked to indicate in probabilistic terms 
the extent of their adherence to the stand taken in 
each proposition. For this purpose each proposition 
was foliowed by a graphic scale consisting of a five-inch 
horizontal line marked off at half-inch intervals and 
calibrated 0, 10, 20. .. 90, 100. To the left of the scale 
was written “Very improbable end,” and at the right, 
“Very probable end.” The Ss were asked to decide how 
probable they felt the truth of each statement to be, 
and to indicate this subjective probability by drawing 
a line through the scale at the appropriate point. 

In a later section of the questionnaire the proposi- 
tions were repeated, this time with a scale consisting 
of five boxes labelled from left to right “very desirable,” 
“somewhat desirable,’’. . .,” “very undesirable.” The S 
was instructed to indicate his feeling regarding the 
desirability of each proposition by putting an X in the 
appropriate box. (As explained below, these ratings of 
desirability were obtained to allow identification of 
initial inconsistencies, due to wishful thinking, among 
the probability ratings.) 

The questionnaires were administered during each 
of the three experimental sessions. Between the first 
and second administrations of the opinion question- 
naires, a series of persuasive messages were communi- 
cated to the Ss in mimeographed booklets.* There were 
16 such messages, each directed at one premise of each 
of the 16 syllogisms. Each message argued that the 
proposition in question was true, i.e., was aimed at 
increasing the S’s probabalistic estimate of the state- 
ment’s truth. These messages were “one-sided,” 
200-300 words long, employed rational appeals, and 
based arguments on factual evidence. They were mild 
in tone and attributed to a purportedly competent and 
dependable source. 

1 A complete list of all the syllogisms and persuasive 
messages used in the present study has been deposited 
with the American Documentation Institute. Order 
Document No. 6235, from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Congress, 
Washington 25, D. C., remitting in advance $1.25 for 
microfilm or $1.25 for photocopies. Make checks 
payable to Chief, Photoduplication Service, Library of 
Co 


ngress. 
One of the 16 syllogisms used was as follows. Stu- 
dents who violate any regulation that has been made 
to safeguard the lives and property of other students 
will be expelled; the regulations against smoking in the 
classrooms and corridors were made to safeguard the 
lives and property of the student; students who violate 
the regulation against smoking in the classrooms and 
corridors will be expelled. 

*The text of all 16 persuasive messages can be 
obtained in the ADI Document No. 6235, cited in 
Footnote 1. 
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Procedure. There were three 50-min. experimental 
sessions. In the first, the questionnaires were admin- 
istered to obtain “before communication” measures of 
opinions on the issues. The second session, which came 
one week after the first, was devoted to communicating 
the persuasive messages and then to a second admin- 
istration of the opinion questionnaires in order to 
measure the immediate impact of the persuasive mes- 
sages. One week later, the Ss participated in the third 
session during which the questionnaires were again 
administered (in order to ascertain the delayed effects 
of the persuasive messages). At this final session the 
Ss were also told the purpose of the experiment and the 
E pointed out the deceits that he had employed in the 
experiments and his reasons for so doing. 

Until this final “catharsis,” an attempt was made 
to conceal, or at least distract attention from, the 
persuasion aspect of the experiment and from the 
investigator’s interest in cognitive consistency. The 
study was purported to be an investigation of people’s 
ability to understand experts’ discussions of contro- 
versial topics. The persuasive messages were alleged to 
be excerpts from such discussions, and after each 
message the Ss were required to answer a series of 
comprehension questions (to emphasize the contention 
that we were primarily interested in studying the 
Ss’ understanding of the material). The opinion ques- 
tionnaires were explained as required to ascertain 
whether the issues dealt with were actually contro- 
versial in the group being studied and whether they 
continued to be controversial throughout the period of 
the experiment. 

Subjects and design. A total of 120 Ss participated 
in the experiment. One-fourth of these were high school 
seniors and the remainder, college freshman; the ratio 
of men to women was about 2:1 at both grade levels. 
To minimize the likelihood that the consistency hy- 
potheses would receive specious confirmation by the 

‘employment of atypically rationalistic Ss, these Ss 
were selected from a group with a rather low level of 
intellectual attainment. Their modal high school grade 
average was below the 30th percentile for all high 
school students in their state: Each of these students 
reported that he had never had a course in formal logic 
or in probability theory. 

In order that the Ss might have time to respond 
thoughtfully to the opinion items and to read the 
persuasive messages carefully, each S was required 
to provide data on only 8 of the 16 syllogistically 
related sets of propositions. On only four of these 
eight did he receive persuasive messages, the other four 
serving as no-communication “control” items. These 
no-communication syllogisms provided the data for 
testing the Socratic effect hypotheses. The syllogisms 
on which messages were communicated provided the 
data for testing the hypotheses concerning logical 
repercussions. Specific syllogisms were alternated 
between message and no-message conditions from S$ 
to S. 


RESULTS AND DISCUSSION 
Hypotheses Concerning the Socratic Effect 


Initial inconsistencies among the opinions. 
In order to identify initial opinion incon- 
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sistencies we employed an auxilliary hypothe- 
sis that the subjective probabilities of the 
propositions would be distorted in the direction 
of wishful thinking. Internal evidence indi- 
cates that this assumption was a valid one. 
The 16 syllogisms were partitioned into two 
subsets: the eight in which the desirability of 
the conclusions was low relative to that of the 
premises; and the eight whose conclusions 
had relatively high desirabilities. Wishful 
thinking would tend to make the subjective 
probabilities of the conclusions low relative 
to those of their (more desirable) premises in 
the first subset of syllogisms, while the subjec- 
tive probabilities of the conclusions in the 
second set should be high relative to their less 
desirable premises. This prediction is borne 
out by the subjective probability ratings ob- 
tained in the first session, as shown in Table 1. 
(To avoid confusing them with the significance 
levels, the obtained subjective probability 
ratings given in this report are the obtained 
probability ratings, multiplied by 100.) The 
mean subjectiv. probability of the conclusions 
exceeded the mean joint probability of the 
premises by only 2.20 points on the 100-point 
scale in the first eight syllogisms, while in the 
other eight syllogisms (the conclusions of 
which had relatively high desirability) the 
mean subjective probability of the conclusions 
exceeded the mean joint probability of the 
premises by 17.18 points. This difference 
between 2.20 and 17.18 points is significant 
beyond the .01 level. (All significance levels 
reported in this paper are based on analyses 
of variance and two-tailed tests.) Hence, the 
attempted manipulation of initial inconsistency 
by use of the tendency toward wishful thinking 
was successful. 

The main effect. The changes in opinion 
obtained in the conditions in which no persua- 
sive message was communicated on the given 
issues provide the test of the main Socratic 
hypothesis, that when the person’s opinions 
on related issues are elicited in close temporal 
continguity, his need for consistency is brought 
more fully to bear on them. Hence, when 
these opinions are again elicited on a later 
occasion, they will be found to have changed 
towards greater mutual consistency. In view 
of the initial distortions from consistency 
produced by wishful thinking, it would follow 
that in the no-communication conditions in the 
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TABLE 1 


Suspyect Prospasitity Scores (MULTIPLIED BY A Factor or 100) 
CONDITIONS 


IN THE No-COMMUNICATION 


| Excess of 
Conclu- | Conclu- 


Minor } gous et c 
sions | sions over 


| Premises remises 
| 
| 





Syllogisms with conclusions more de- 


sirable than premises 


Syllogisms with conclusions less desir- | ist 
able than premises 


present experiment initial questioning should 
produce an increase in the subjective proba- 
bilities of the conclusions relative to those of 
the premises in the first eight syllogisms 
(those with the relatively undesirable con- 
clusions) ; while for the second eight syllogisms, 
there would be a relative decline in the subjec- 
tive probabilities of the conclusions. 

The results bear out this prediction. As 
can be seen in Table 1, the gain in probability 
of the conclusions over that of the premises in 
the first eight syllogisms averaged 5.96 points. 
The probabilities of the conclusions of the 
second eight syllogisms, on the other hand, 
showed a loss of 1.72 points relative to the 
premises. This net difference of 7.68 points 
between means is significant at the .02 level. 
Hence, simply eliciting opinions on logically 
related issues does tend to move those opinions 
toward greater mutual consistency when sub- 
sequent elicitation occurs one week later. 

The difference obtained between the two 
sets of syllogisms as regards internal changes 
is in the direction that would be expected on 
the basis of a simple regression from the 
initially obtained difference. In order to test 
whether there is a Socratic effect over and 
above simple regression, an additional covari- 
ance analysis was carried out. The obtained 
difference in change of probability scores 
between the two sets of syllogisms was adjusted 
for the difference in initial levels by means of 
the obtained within-set correlation between 
the initial probability scores and the changes 
therein. Even when the obtained changes are 
adjusted for initial levels by this analysis of 
covariance, the difference in changes remains 
significant at the .02 level, indicative that a 
significant Socratic effect remains, even when 
the effect of regression is partialed out. 


Ist 
Changes, ist to 2nd 


Changes, ist to 2nd 


54.14 | 
1.45 | —0.79 


| 
55.41 | 30.12 | 
| 1.98 | 2.06 | 





The Socratic effect tends to be produced 
primarily by the first contiguous elicitation of 
the opinions, with only slight evidence of an 
accumulative effect from subsequent, addi- 
tional elicitations. As discussed above, there 
was a net Socratic effect, from the first to 
second elicitation of the opinions, of 7.68 
points. The corresponding net effect, from 
second to third elicitations was again in the 
predicted direction but amounted to only 2.62 
points, which falls far short of the conventional 
levels of statistical significance. 

Focus of the Socratic effect. An attempt was 
made to answer the further question of where 
this Socratic effect is focused; that is, which of 
the person’s initially inconsistent opinions are 
most likely to be changed in bringing about the 
greater consistency? The present data were 
analyzed to ascertain whether the amount of 
change was related to any of three variables: 
the logical status of the propositions, their 
initial valence, or their ordinal position on the 
questionnaire in the first session. 

As regards logical status of the proposition, 
we tested whether the major premise, minor, 
and conclusion contributed differentially to 
the change towards consistency. All of the 
syllogisms used in this study were of the 
“Barbara” type. Hence the major premise 
contained the M-P terms; the minor, the 
S-M terms; and the conclusion, the S-P terms 
(where S refers to the subject of the conclusion; 
P to its predicate; and M to the middle term). 

Logical status of the proposition did have a 
significant overall impact on amount of opinion 
change. As can be seen in Table 1, the major 
premise contributed most of the change to- 
wards increased consistency, the mean change 
being 3.50 points on the 100-point scale. The 
conclusion showed the next greatest change 
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towards consistency, a mean of 2.41 points; 
and the minor premise contributed least, in 
fact their mean change was slightly (1.71 
points) away from consistency. These differ- 
ences among the three means are significant 
at the .05 level by an overall analysis of vari- 
ance. 

The overall significance of the variance 
among the three means is due primarily to the 
greater change towards consistency (averaging 
2.95 points) of the propositions with the P 
terms (namely, the major premises and the 
conclusions) than of those without these P 
terms (namely the minor premises), which 
averaged a 1.71 point change away from con- 
sistency. The difference between the two 
classes of propositions is significant at the .03 
level. It appears to be explicable on the basis 
of initial distortion. In these “Barbara” 
syllogisms, the P terms tend to have the 
greatest emotional valence. Therefore, by the 
auxiliary hypothesis of wishful thinking, the 
propositions that include these emotionally- 
tinged terms; would have been the most dis- 
torted originally. The minor premises, without 
the P terms, are typically statements of class 
inclusion (“Socrates is a man”) and therefore 
less likely to be distorted. Hence, the greater 
change towards consistency of the other 
propositions appear to be a consequence of the 
fact that the others had contributed more of 
the original inconsistency. 

Two other propositional characteristics 
were investigated for possible relationship to 
the amount of change in opinion produced by 
the Socratic method: initial valence ard initial 
order of presentation. It was found that some- 
what more change towards consistency oc- 
curred on the highly valenced propositions— 
the 12 most and the 12 least desirable proposi- 
tions—than on the 24 moderately valenced 
ones. The direction of this outcome tends to 
reverse the “polarity” prediction (Osgood & 
Tannenbaum, 1955; Tannenbaum, 1956), but 
it does not approach the conventional level 
of statistical significance. Regarding _ initial 
order, it was found that somewhat more 
change toward consistency occurred om those 
propositions which, in the first session, ap- 
peared earlier on the questionnaire than on 
those that appeared towards the end, but that 
this difference was not statistically significant. 
The difference has been predicted on the 
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assumption that the later appearing proposi- 
tions had already, on the first administration 
of the questionnaire, been brought into closer 
agreement with the opinions given on the 
propositions that had come earlier, so that 
less change remained to be made. 

A number of other hypotheses, not tested in 
this study, regarding propositional charac- 
teristics that affect amount of change in cases 
of mutual inconsistency could be tested. 
Opinions on more familiar issues may be more 
stable than those on less familiar. Opinions on 
issues that show greater cognitive complexity 
and greater articulation with the rest of the 
person’s cognitive system (as defined, for 
example, by Zajonc’s [1954] procedures) might 
participate less in the change towards consist- 
ency than more isolated and undifferentiated 
opinions. Tannenbaum’s (1956) finding (that 
when differently valenced sources and stands 
on issues are paired, the person’s attitude on 
the issue shows less change than his attitude 
towards the source) is suggestive in this regard. 
Following Asch’s (1952) thinking that persua- 
sion involves a change of the issue being 
judged rather than change of judgement on the 
given issue, we might expect that in case of 
conflict, the greatest change would occur on 
the issues most easily redefined. Another 
hypothesis is that the relative proneness to 
readjustment of conflicting opinions is related 
to the bases for accepting the opinions. For 
exa.nple, following Kelman’s (1956) terminol- 
ogy, we might conjecture that opinions based 
on internalization might be more prone to 
change, in case of internal conflict, than those 
based on identification or on compliance. 
These latter need not be consistent with one’s 
other beliefs, because the adherence to them 
is based on extrinsic considerations. 
Hypothesis Concerning Logical Repercussions 

The results just reported regarding the 
Socratic effect hypotheses are all based on 
data in the no-communications conditions. 
The following results, which deal with the 
logical repercussions of persuasive messages 
are based on data from the conditions in 
which persuasive communications were re- 
ceived. The changes reported as due to the 
persuasive messages are computed from a base- 
line provided by the change from first to 
second session in the no-communication 
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condition. Since the results concerning the 
Socratic effect indicate certain systematic 
changes in opinions occur from first to second 
session even in the absence of persuasive 
messages, the message-induced changes must 
be computed from a control baseline that will 
eliminate those other changes and also any 
regression trends. 

Immediate postcommunication changes. The 
induced changes of opinion on the explicit 
target issues (the premises) were considerable. 
Each message argued that the stand taken in 
one of the premises of each syllogism was true 
and that, therefore, the receiver should in- 
crease his subjective probability regarding the 
truth of that proposition. As can be seen in 
Table 2, the mean induced increase on the 16 
explicit propositions immediately after the 
communication was 17.56 points on the 100- 
point scale (p < .001). 

In order to maintain cognitive consistency, 
the Ss should therefore have increased their 
subjective probabilities regarding the truth of 
the conclusions. Even though neither conclu- 
sions nor the additional premises needed for 
their derivation had been adverted to in the 
communications, a considerable immediate 
postcommunication increase was indeed pro- 
duced on these conclusions. As can be seen 
in Table 2, there was an increase (as required 
for consistency) immediately after the mes- 
sages on 14 of the 16 unmentioned conclusions, 
the mean increase being 5.96 points. A mean 
change of this magnitude is significant at the 
.01 level. Hence, the results do indicate a 
consistency tendency sufficiently strong to 
cause a persuasive message to have an impact 
on logically related issues not explicitly men- 
tioned in the communication. 

These obtained changes on the derivative 
issues are, however, less than the amount 
required for complete logical consistency as 
determined by substituting the amount of the 
obtained change on the explicit premise in 
Equation 3. The mean required increase on 
the conclusions so computed was 11.57 points, 
while the actually obtained increase was only 
5.96 (see Table 2), a difference significant be- 
yond the .05 level. Hence, persuasive mes- 
sages, while they do produce changes on 
remote, unmentioned issues, produce less than 
the amount logically required, a finding that 
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supports the hypothesis of a certain amount of 
cognitive inertia. 

Delayed posicommunication changes. A con- 
siderable amount of the immediately induced 
persuasion on the explicit target issues had 
dissipated by the end of the week following 
the communication. As can be seen in Table 2, 
almost half of the immediately induced change 
on these explicit issues had dissipated one 
week later. The mean loss of 8.43 points was 
significant at the .02 level. On the other hand, 
the 9.13 points of change remaining after one 
week was also significant (at the .01 level), 
indicating that there is still a significant 
amount of change of opinion on the explicit 
issue one week after the persuasive communi- 
cation. 

When this loss on the premises is substituted 
in Equation 3 the amount of the originally 
induced change on the unmentioned derived 
issues that should logically have dissipated 
during the succeeding week was computed as 
6.69 points (see Table 2.) The obtained loss 
during the interim, however, was only a neg- 
ligible 1.48 points, a gain of 5.20 points relative 
to the explicit premises for the week (p = .10). 
In fact, by the week after the persuasive com- 
munication, the net mean change on the con- 
clusions from their precommunication level 
was an increase of 4.47 points, which was 
almost as much as the 4.89 points logically 
required in view of the net changes on the 
premises. The trivial extent to which the 
change on conclusions fell short of the required 
amount one week after, as compared with the 
significant extent to which it fell short immedi- 
ately after the communication, gives some 
support to the hypothesis that the persuasive 
messages had a delayed-action effect on the 
derived issues. The net change on the derived 
issues did not actually increase with the pas- 
sage of time, but it fell off less sharply than 
would have been expected on the basis of the 
usual rate of temporal decay of the immedi- 
ately transmitted opinion change, suggesting 
that continued seepage of the impact on the 
explicit issue almost completely overcame the 
opposing decay effect on the derived, unmen- 
tioned issue. 

Comparison with previous studies. One 
methodological artifact in the present study, 
the use of a before-after design, may have 
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augmented the amount of the logical reper- 
cussions of the persuasive messages. In order 
to obtain each S’s precommunication opinions 
he was required to fill out, one week prior to 
receipt of the persuasive messages, an opinion 
questionnaire containing all the issues, includ- 
ing the auxilliary premises and conclusions 
not mentioned in the message itself. It is 
possible that having stated his opinions on 
these related issues during the previous week 
may have somewhat increased their saliency, 
so that they were more affected by the message 
than would have been the case had they not 
been raised the previous week. 

On the other hand, the finding of logical 
repercussions in the present experiment is 
particularly convincing in view of the fact 
that the method included a number of condi- 
tions that have been shown in previous studies 
to militate against logical repercussions. In the 
first place, not only was the conclusion itself 
unmentioned in the messages but also one of 
the two premises necessary for its derivation 
was left unmentioned. Also, the Ss employed 
were of a type, being below average in intelli- 
gence, that has been found (Cooper & Diner- 
man, 1951) to be relatively resistant to non- 
explicit messages. Furthermore, no mention 
was made during the experiment of our interest 
in consistency, and the logically related propo- 
sitions were widely interspersed with other 
propositions within the questionnaire.’ Finally 
the Ss gave their opinions on a scale with 
whose computational axioms they were un- 
familiar, so that they could hardly have deter- 
mined the extent of their inconsistencies even 
had they been interested in this matter. 

Hence, the negative indications reported in 
some previous studies for the hypothesis that 
persuasive communications have logical reper- 
cussions require comment. While more persua- 
sion has typically been found on a derived 
issue if that issue is explicitly mentioned 
rather than only implied, the data reported 
typically do show some change on the non- 
explicit issue (Hovland, Lumsdaine, & 
Sheffield, 1949; Hovland & Mandell, 1952). 
Also, the relative lack of effect on the derived 
issue is most evident in the immediate post- 
communication periods. Hovland, Lumsdaine, 
and Sheffield (1949, pp. 70-71) found that the 
“Why We Fight” films did have an impact 
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on more general attitudes after the passage of 
a greater amount of time, and have postulated 
that there may be need for a “sinking in” 
period before these derivative effects are felt. 
Similarly, Katz, Stotland, Patchen, and 
Jochen (1957) report that giving Ss case his- 
tory material regarding the roots of prejudice 
causes little immediate effect on their own 
prejudice, but after the passage of several 
weeks, the extrapolation of the intellectual 
knowledge to their opinions does occur. Fur- 
thermore, the magnitude of the logically 
required change on the derived issue is, as 
shown by Equation 3, less than that on the 
explicit issue. This logically consistent attenu- 
ation of the persuasive impact on derived 
issues due to a compounding of contingencies 
becomes more severe as the derived issue 
becomes more tenuously related to the explicit 
issue. For example, the more premises beside 
the explicit one needed for its derivation, the 
less the logically required change on the 
derived conclusion. Where the derived issues 
are as remote from those explicitly dealt with 
as was the case, for example, in the Hovland, 
Lumsdaine, and Sheffield “Why We Fight” 
studies (1949), very little change on the de- 
rived issues would be logically required even 
in a perfectly consistent S. 

However, while only a small change in 
opinion is logically required on the derived 
issue, the amount actually occurring is even 
less. People’s opinions seem to follow a “loose- 
link”’ model such that a considerable change 
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can be induced on one cognition before any 
pull is exerted on related cognitions. This 
effect, however, is at least partially corrected 
by the continued seepage of change from the 
explicit issue to the derived one as time passes 
after the receipt of the communication. 

The question of conscious awareness arises 
in connection with the psychodynamics of the 
present effect: to what extent is it necessary 
that the person be consciously aware of the 
inconsistencies among his beliefs in order that 
the effects occur? In the present study several 
procedures were employed in an attempt to 
minimize the degree of conscious awareness. 
(For example, no mention was made to the Ss 
that their consistency was being assessed; the 
related propositions were widely separated 
from one another in the questionnaire; a 
quantitative expression of opinion was required 
and the assessment of inconsistency involved 
a mathematical model unfamiliar to the Ss.) 
Hence, the present findings indicate that the 
effects occur even under conditions that 
attempt to avoid any suggestion of incon- 
sistency. 

The Ss’ level of awareness of inconsistency 
was not systematically varied within the 
present experiment, but it is interesting to 
speculate on the results of such a manipula- 
tion. Increasing the person’s awareness of his 
inconsistency by, say, explicitly pointing it out 
to him might produce greater changes towards 
consistency. On the other hand, such a pro- 
cedure could well have the opposite effect by 
provoking the person’s hostility or defensive- 
ness, and so result, not in a greater change 
towards consistency, but in some other reac- 
tion such as the rejection of consistency, with 
Carlyle, as the hob-goblin of little minds; or 
in the repression of some of the conflicting 
issues. 


SUMMARY 


Two sets of hypotheses were derived from 
the postulate that people tend to maintain 
consistency among their opinions on logically 
related issues. First, it was hypothesized that 
opinion change would be effected by the 
Socratic method of simply asking the person 
to state contiguously his opinions on logically 
related issues, thus bringing the tendency 
towards consistency to bear on any logical 
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discrepancies among these opinions. Secondly, 
it was hypothesized that a persuasive com- 
munication directed at a person’s opinion on 
some explicit issue tends to change also his 
opinions on logically related (even if unmen- 
tioned) derivative issues in a consistent direc- 
tion. A procedure for quantitatively measuring 
the degree of logical consistency among opin- 
ions on related issues was described. 

The Ss were 120 college students participat- 
ing in three experimental sessions. In the 
first, before-communication opinions regarding 
the subjective probabilities and desirabilities 
of 16 sets of syllogistically related propositions 
were obtained. In the second session, 16 
messages, each arguing for the truth of one 
premise of each syllogism, were communi- 
cated, and the immediate postcommunication 
opinion of the truth of each of the 48 proposi- 
tions was obtained. Each S served in com- 
munication conditions with respect to some 
issues, and control, no-communication condi- 
tions with respect to others. In the third 
session, one week later, the delayed post- 
communication opinions on the truth of the 
48 propositions were obtained. 

With respect to the Socratic effect hypothesis 
a significant amount of inconsistency attribut- 
able to wishful thinking was found, as pos- 
tulated on the first elicitation of the opinions 
(p < .01). One week later, on a second elicita- 
tion of the opinions, there was a significant 
(p = .02) decline in the amount of this wishful 
thinking, indicating that the Socratic method 
did indeed produce the predicted change 
towards consistency. Several propositional 
variables were examined for possible relation- 
ship to the amount of change towards consist- 
ency. It was found that propositions with the 
predicate term (namely, major premises and 
conclusions) contributed significantly more 
(p = .03) of the change towards consistency 
than those without the predicate term. 

With respect to the hypothesis concerning 
logical repercussions, it was found that immedi- 
ately after the communication there was an 
induced opinion gain not only on the explicit 
target issues (p < .001) but also the derived 
issues (p = .02), even though neither the 
derived issues themselves nor even the auxil- 
liary premises needed for their derivation 
were mentioned in the communication. This 
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indirectly induced gain on the derived issues 
fell short, however, of the full amount needed 
for complete logical consistency (p < .05). 
One week later, there was some suggestion 
that the impact of the persuasive message 
had continued to seep down to the derived 
issues: immediately after the messages the 
opinions on the derived issues had shown only 


52% of the logically required amount of 


change, while by one week after, this impact 
had risen to 91% of the required amount 
(p = .10). 
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HE present study was designed to test 

some applications of Festinger’s dis- 

sonance theory (1957) to the area of 
persuasion. Festinger has predicted (1957, Ch. 
9) that a person tends to resist‘ a persuasive 
message to the extent that it tends to increase 
dissonance. The present study tests a some- 
what broader hypothesis. When a person is 
presented with a message that argues for a 
position on an explicit issue which tends to 
bring his opinion on that issue into conflict 
with other, unmentioned but logically related 
issues, he can avoid increased dissonance in two 
ways: (a) by resisting change on the explicit 
issue and (6) by changing also on the related 
(unmentioned) issues. Hence, it is predicted 
that a dissonance-increasing message, as com- 
pared with a dissonance-reducing one, pro- 
duces (a) less change on the explicit issue, and 
(6) more change on unmentioned but logically 
related ones. 

Furthermore, on the assumption that 
changes induced in a consistency-increasing 
direction are stabilized by the bolstering effect 
of the consonant cognitions, it is hypothesized 
that dissonance-reducing changes persist longer 
after the presentation of the communication. 
Hence, it is predicted that both of the con- 
sonant types of change—namely, the changes 
on the explicit issues produced by the con- 
sistency-increasing messages and those on the 
related, unmentioned issues produced by the 
consistency-decreasing messages— show slower 
rates of temporal decay than do the corre- 
sponding dissonant changes. Inconsistencies 
were identified on the basis of an assumed 
tendency toward wishful thinking. 


METHOD 


Procedure. There were three experimental sessions 
held at one week intervals during regularly scheduled 
classes of a freshman English course. Until the post- 
experimental “catharsis,” the study was represented 
as an investigation of the relation between reading 
comprehension ability and the readers’ beliefs regard- 
ing the material read. At each session a questionnaire 
was administered to measure the Ss’ opinions on the 
truth of 24 critical propositions. At the first session 
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the use of the opinion scales for the truth judgments 
and also for measuring the desirabilities of the proposi- 
tions was explained. 

At the beginning of the second session the persuasive 
messages were communicated and memory questions 
were asked about the content (to support the fiction 
that we were studying reading comprehension), and 
then the opinion questionnaire was administered to 
determine the immediate effects of the communications. 
In the third session, the questionnaire was again ad- 
ministered (to measure the persistence of the induced 
opinion changes) and the true nature of the experiment 
was then explained to the Ss. 

Material. The opinion questionnaire consisted of a 
list of 28 propositions. Of these, 24 constituted 8 
syllogisms' of the “Barbara” type (the other four were 
repetitions of propositions to provide a reliability 
check). Beneath each proposition was a scale, described 
elsewhere (McGuire, 1960), on which the S was in- 
structed to indicate his subjective probability regard- 
ing the statement’s truth. Opinions regarding the 
desirability of each proposition were obtained on a 
five-point desirabiltiy scale described by McGuire 
(1960). 

Each persuasive message contained a reasoned 
appeal, about 250 words long, attributed to a (fictitious) 
person with a prestige affiliation, arguing in favor of 
the truth of the premise of one of the eight syllogisms. 

Subjects and design. Each of the 35 college freshmen 
participating in this study received persuasive messages 
on four of the issues and served as a no-communication 
control on the other four issues. 


RESULTS 
Identification of the initial inconsistencies 


In order to test the present hypotheses, it 
was necessary to determine for each message 
whether it was arguing in direction that in- 
creased inconsistency or decreased it. Since 
each message was aimed at raising the rated 
truth of a premise, this determination required 
knowing whether the initial truth rating of the 


1 For a detailed account of the syllogisms and the 
persuasive messages used in this study, see the ADI 
document No. 6235 referred to in McGuire (1960) Fn. 1. 
An example of a syllogism whose premises are rated 
less desirable than the conclusions (Syllogism 8 in 
Table 1) is the following: Even should there be a war 
with Russia, any American city hard to reach by air 
from the European continent will escape nuclear bomb 
attack. Our city is hard to reach by air from the Euro- 
pean continent. Our city will escape nuclear bomb 
attack even should there be a war with Russia. 
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premise had been inconsistently high or in- 
consistently low on the first administration. 
To this end, we postulated a tendency toward 
wishful thinking, amply demonstrated in pre- 
vious studies by Cantril (1938) and McGregor 
(1938), such that the rated truth values of 
propositions are distorted in the direction of 
their rated desirabilities. The eight syllogisms 
were therefore divided into four, the premises 
of which were rated highly desirable relative 
to the conclusions and four with premises 
rated relatively low in desirability. In the case 
of the first four, the persuasive messages were 
interpreted as arguing in an inconsistency- 
increasing direction, and for the second four, 
in a relatively inconsistency-reducing direc- 
tion. 

The data from the first session tend to con- 
firm this auxiliary hypothesis concerning 
wishful thinking. If the person were being 
perfectly consistent, his subjective probability 
of the conclusion should have been directly 
proportional to the product of the subjective 
probabilities of the premises (assuming that 
internal consistency among subjective proba- 
bilities requires their following the axioms of 
probability theory, and that the proportional 
contribution of the premises to the total prob- 
ability of the conclusions remained constant 
from syllogism to syllogism). To the extent 
that, on the contrary, his beliefs were being 
distorted by wishful thinking, the excess prob- 
ability of the conclusions, over the product of 
the probabilities of the premises, should have 
been greater in Syllogisms 5 to 8, than in 
Syllogisms 1 to 4. The data (see Table 1) give 
evidence of a considerable amount of such dis- 
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tortion. For Syllogisms 1 to 4, the mean excess 
probability of the conclusions was only 8.99, 
while for Syllogisms 4 to 8 it was 18.90 
(throughout this report, the subjective prob- 
ability scores cited are the obtained proba- 
bility ratings multiplied by 100, in order to 
avoid confusion with the statistical significance 
levels). The obtained difference of 9.91 is 
significant at the .05 level. (All significance 
levels in this report are based on two-tailed 
analyses of variance tests.) 


Wishful and logical thinking 


While it was not a primary purpose of the 
present experiment to compare these two tend- 
encies, the data make possible an estimate of 
the extent to which wishful thinking and logical 
thinking each affects opinions regarding the 
truth of the conclusions. If a person’s opinions 
are to be logically related, then his rating of the 
probability of the conclusion (p(c)) should 
equal the joint rated probability of the prem- 
ises (p(a)p(b)) plus the rated probability of the 
conclusion on the basis of all circumstances 
other than those specified in the premises. 
Therefore, if we assume that these other cir- 
cumstances contribute about equal proportions 
of the total probability of the conclusion from 
syllogism to syllogism, then consistency re- 
quires that the probability of a conclusion be 
proportional to the joint probability of the 
premises from which it follows. The group data 
indicate that, by this criterion there is a high 
degree of consistency: the correlation between 
the initial p(c) and p(a)-p(6) for the eight 
syllogisms is .74. 

Wishful thinking accounts for a large part 


TABLE 1 
Trutn RATINGS ON THE PREMISES AND THE CONCLUSIONS BEFORE AND AFTER MESSAGES WHICH ARGUED 
FOR THE INCREASED TRUTH OF THE PREMISES AND LEFT THE CONCLUSIONS UNMENTIONED 
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of the remaining variance among the con- 
clusions. When the effects of the desirability 
ratings of the premises and conclusions are 
removed by partial correlation, the correla- 
tion between p(c) and p(a)p() becomes .96, 
which is significant at the .01 level despite the 
smal! number of degrees of freedom. 

Since these and the previously reported 
correlation coefficients are computed from 
group means it is not legitimate to generalize 
to the individual case. However, even on the 
basis of group data it is interesting that 90% 
of the variance among the mean opinions on 
the conclusions can be accounted for with just 
two variables. 


Effects Produced by the Persuasive Messages 


Opinion changes on the explicit issues. The 
first experimental hypothesis stated that the 
persuasive messages would produce opinion 
change on the explicit issues to the extent they 
argued in a direction that would increase con- 
sistency. Since the auxiliary hypothesis con- 
cerning wishful thinking was confirmed, it 
would be predicted on the basis of the first 
hypothesis that a greater increase in the p(a) 
opinions should be produced by the messages 
directed at the relatively undesirable premises, 
than by those directed at the relatively de- 
sirable. 

This hypothesis is supported by the finding 
that the mean increase in probability on the 
desirable target premises (of Syllogisms 1 to 
4 in Table 1) was 14.37, while the mean for the 
undesirable premises (5 to 8 in Table 1) was 
23.65, a difference significant at the .01 level. 
That this greater impact on the undesirable 
premises is not a regression artifact can be 
demonstrated by further analyses. In the 
first place, there is not a great deal of differ- 
ence between the initial probability levels 
of the two sets of premises. The mean proba- 
bility rating of the relatively undesirable 
premises is 50, and of the desirable, 55, which 
is too small an initial difference to account for 
the 10-point differential in effect by statistical 
regression. A still more convincing argument 
against alternative interpretations is provided 
by considering the changes from a baseline 
provided by the control data, that is, changes 
from before to after the communication for 
those subjects not receiving a communication 
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on the given issue. From this baseline, the 
differential is even greater: the mean increase 
induced by the message on the relatively un- 
desirable premises was 20 and on the desirable, 
11. This difference again is significant beyond 
the .01 level. 

Effects of the persuasive messages on the re- 
lated, unmentioned issues. While some studies 
report finding that persuasive communications 
produce little if any effect on related but un- 
mentioned issues (Cooper & Dinerman, 1951; 
Hovland & Mandell, 1952), McGuire (1960) 
has demonstrated that remote repercussions 
in the person’s cognitive arena do occur, 
though usually to a lesser extent than would 
be required for complete consistency with the 
induced change on the explicit issue. The pres- 
ent hypothesis states that such remote impact 
on unmentioned issues is greater for messages 
arruing in an inconsistency increasing direc- 
tion, since otherwise the already dissonant 
op.nions on these derived issues would get 
still further out of line with opinions on the 
explicit issues. Since all the messages in this 
study were designed to increase the rated 
truth of a premise, the prediction follows that 
the increase of the truth ratings of the con- 
clusion produced by the messages aimed at 
the desirable premises (Syllogisms 1 to 4) 
should be greater than those aimed at the un- 
desirable premises (Syllogisms 5 to 8). The 
prediction is supported by the data. The four 
conclusions of the desirable premises show a 
mean increase of 11.15 points after the mes- 
sages, while the conclusions of the undesirable 
premises increase by a mean of only 1.70. 

This finding is especially striking since the 
changes induced on the desirable premises 
themselves were, as has just been described, 
less than the changes on the undesirable prem- 
ises. Assuming the subjective probability 
ratings behave according to the axioms of 
probability theory, we can compute by sub- 
stituting the induced change on the prem- 
ises in the following equation: 


AP(c) = P(a)-AP(b) + P(6)-AP(a) 
+ AP(a)-A4P(b) 


where P(a) and P(d) are the initial opinions on 
the premises. Substituting the obtained 
changes on the premises in this equation, it 
can be computed that, to be consistent, there 
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should have been a mean change of 8.30 for 
the conclusions of the desirable premises and 
17.48 for the conclusions of the undesirable. 
Hence, the obtained changes on the conclusions 
of the desirable premises exceeded the required 
amount by 2.85 whereas the obtained change 
on the conclusions of the undesirable premises 
fell short of the required amount by 15.78 
points, a difference significant at the .01 level. 
The differential is reduced when, to control for 
possible regression effects, the changes in the 
communication conditions are computed from 
a baseline provided by the control (no com- 
munication) conditions, but the reduced differ- 
ence is still significant at the .05 level. 

The relative persistence of consistent and in- 
consistent opinion changes. On the assumption 
that the bolstering effect of consonant cogni- 
tions would stabilize changes in a direction 
reducing inconsistency, it was hypothesized 
that such changes would be more persistent 
than the dissonance-increasing changes. Hence, 
we predicted that as regards the changes on 
the premises themselves, increases on the un- 
desirable premises would be more lasting than 
those on the desirable; regarding the derived 
changes on the conclusions, we predicted, con- 
versely, that increases on the conclusions of 
the desirable premises would be more lasting 
than the increases on the conclusions of the 
undesirable premises. 

Neither of these predictions is supported 
by the data. In regard to the persistence of the 
induced increase on the explicit issues, 7.60 
points of the originally induced 14.37 on the 
desirable premises dissipated during the week; 
while 14.34 points of the originally induced 
23.65 on the undesirable premises dissipated 
(see Table 1). Hence, in terms of the arithmetic 
loss during the week, the dissonance-reducing 
changes show 6.74 points more decay than 
those which increased dissonance, reversing 
the prediction that the dissonance-increasing 
changes would show more decay. In terms of 
percentage loss of originally induced increase, 
the two conditions show approximately equal 
losses at the end of one week. The prediction 
regarding greater stability of dissonance re- 
ducing changes on the remote issues likewise 
is not confirmed by the data. The relatively 
desirable conclusions show a 4.07 point decline 
during the postcommunication week, and the 
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undesirable conclusions, a decline of 6.75 
points. Hence, the dissonance-increasing 
changes are slightly more persistent than the 
dissonance reducing ones when measured in 
terms of amount of decay; and the dissonance- 
reducing changes slightly more persistent when 
measured in terms of rate of decay. 


DISCUSSION 


The confirmation of the hypothesis of greater 
immediate impact of the dissonance-reducing 
communications is particularly striking be- 
cause, in the case of both the explicit and the 
related issues, the tendency toward wishful 
thinking works against it. The implication of 
our finding is that a person can more easily be 
argued into an increased acceptance of the 
truth of an undesirable state of affairs than a 
desirable one. The underlying reason for this, 
suggested by dissonance theory, is that before 
the persuasive communication, he has already 
overrated the truth of the desirable outcome 
and because of his need for consistency, there- 
fore resists further uprating that would make 
his expectation become even more dissonant 
with his cognitions of the available evidence. 

The confirmation of both derivations from 
dissonance theory regarding the immediate 
impact of the persuasive messages but the 
complete nonsupport for the derivations re- 
garding the persistence of these immediate 
effects suggests that predictive power of the 
theory could be improved if a postulate con- 
cerning “salience” were added to it. Such a 
postulate would recognize that all the person’s 
cognitions relevant to a given opinion might 
not be equally brought to bear on the opinion 
in a given situation. Rather, one or another 
subarea within this total cognitive arena 
might be made more salient by experimental 
manipulation. It might often happen that the 
person’s cognition on a given issue could be 
made to appear dissonant in either one direc- 
tion or the opposite, depending on which of 
these subsets of relevant cognitions is made 
salient. 

Applying this formulation to the present 
study which investigates the person’s expecta- 
tions regarding desirable and undesirable 
states of affair, we can detect two cognitive 
forces: wishful thinking and logical thinking. 
The person presumably had initially arrived 
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at a steady state of expectation on the issues 
which represented a compromise between the 
two needs—wish fulfillment and reality orien- 
tation—a compromise that was fully consonant 
with neither. In this study the experimental 
manipulation involved presenting messages 
that contained rational arguments regarding 
the probability of the events in question. We 
can reasonably interpret this procedure as 
producing increased saliency of the cognitive 
realm involving logical thinking, reality orien- 
tation and objective evidence. Hence, any 
underlying dissonance between expectations 
and evidence would be activated, with a re- 
sulting readjustment of the cognitions towards 
greater consonance with the evidence and less 
with desires. 

After the person leaves the experimental 
situation, however, the increased saliency of 
the evidential subarea of his cognitive arena 
should tend to subside to its pre-experimental 
level. Consequently, the need for consonance 
between expectations and desires once again 
becomes relatively more salient, and the dis- 
sonance between these beliefs and hopes is 
now felt behaviorally to a greater extent. By 


adding such a postulate of salience, dissonance 
theory could better fit the present results, both 
as regards initial impact and persistence of 
effects. 


SUMMARY 


Several hypotheses regarding the effective- 
ness of persuasive messages were derived from 
Festinger’s ‘“Theory of Cognitive Dissonance.” 
It was predicted that messages would be effec- 
tive in changing opinions on the explicit issues 
to the extent that they argued in dissonance- 
reducing directions; and on related, unmen- 
tioned issues, to the extent that they argued in 
a dissonance-increasing direction. It was fur- 
ther predicted that the dissonance-reducing 
changes would show a slower rate of temporal 
decay after the communication. 

These hypotheses were tested by asking 35 
college students their opinions regarding the 
truth of several syllogistic sets of premises and 
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their valid conclusions. The Ss were then pre- 
sented with messages which argued for the 
increased truth of the premises (leaving the 
conclusions unmentioned). Immediately after 
the persuasive messages, and again one week 
later, the opinions on the truths of both prem- 
ises and conclusions were again measured. 

On the basis of desirability ratings of the 
propositions and of an assumption concerning 
wishful thinking, the syllogisms were parti- 
tioned into two subsets: those whose premises 
received inconsistently high truth ratings 
initially and those receiving relatively low in- 
itial truth ratings. 

Both predictions regarding the immediate 
impact of the messages were confirmed. On the 
explicit issues (the premises), more opinion 
change was produced by the inconsistency- 
reducing messages (p < .01); on the related, 
unmentioned issues more change was produced 
by the inconsistency-increasing messages 
(p < .05). The prediction of greater perma- 
nence for the dissonance-reducing changes 
was not confirmed: the dissonance-reducing 
and dissonance-increasing changes both 
showed about the same rate of temporal de- 
cay. On the basis of these findings, it was sug- 
gested that a postulate concerning “salience” 
be added to dissonance theory. 
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EJECTION by others has been cited as 
R the determinant of a large number of 
different responses, pathological and 
normal. It is widely believed, for example, 
that social hostility as reflected in delinquent 
behavior stems from a parental environment 
in which the child is somehow isolated and 
unloved (cf. Bandura & Walters, 1958 for a 
recent variant of this view). Rejection has 
also been considered an important basis of 
suicide and other self-punitive responses, as 
well as of social withdrawal and apathy (cf. 
Wolberg, 1944). Finally, a laboratory study by 
Dittes and Kelley (1956) showed that subjects 
(Ss) who were given to believe they had low 
sociometric status in the group conformed 
more to the opinions of that group than those 
who thought they were highly accepted. 

It is not clear from these studies whether 
such diverse responses occur under exactly the 
same conditions of rejection, and if so, whether 
they are displayed by the same person. The 
unanswered question is: what responses can 
be expected from a given kind and degree of 
rejection, and theoretically, why? 

A number of experiments (Pepitone, 1958) 
have shown that the magnitude and the neg- 
ative effects of various “ego-threats” can be 
reduced when the person perceives certain 
causal properties relating to responsibility, 
intentionality, etc. in the threatening situation. 
On the basis of these results, a theoretical 
process may be proposed to account for some 
of the responses reported to be associated with 
rejection. The assumption is made that re- 
jection from an important group generates 
more or less severe personal “devaluation 


1 This experiment is one of a series supported by the 
Office of Naval Research through a contract with the 
senior author N-ONR 551-(27). We gratefully acknowl- 
edge this support, although we must take full responsi- 
bility for the views expressed in this paper. We have 
also benefited from a critical reading of the manuscript 
by J. Aronfreed and S. Feshbach. We wish to acknowl- 
edge the help of R. C. Mackowiak in parts of the 
statistical analysis. 


anxiety” in most individuals, and that more or 
less strenuous attempts are made to reduce 
such anxiety. Anxiety-motivated instrumental 
behavior may be primarily directed toward 


‘ changing the social environment. For example, 


rejected persons may comply or identify with 
the rejector’s opinions, become compulsively 
friendly, withdraw from the group, or display 
various other socially adaptive responses. How- 
ever, when socially adaptive responses are 
unsuccessful, impossible, or when for any 
reason, the anxiety is irreversible, the person 
then attempts to make the anxiety more 
toleravle to himself. In other words, when the 
rejection or other basis of devaluation anxiety 
is a fait accompli, persons tend to respond in a 
way that is primarily self-adaptive in function. 

The basic assumption of the present experi- 
ment, suggested by the studies referred to 
above, is that the person can make irreversi- 
ble devaluation anxiety more tolerable by 
explaining or fixing responsibility for it. Spe- 
cifically, it is hypothesized that “causal attibu- 
tion” to the two general sources of responsi- 
bility is reflected empirically in two kinds of 
responses: 

1. Self-depreciation. An “internal” attribu- 
tion of responsibility for the rejection is shown 
by some form of self-depreciation on the part 
of the rejectee. Such a response in effect ac- 
knowledges or “accommodates” the personal 
devaluation implicit or explicit in the rejec- 
tion and thereby makes it more bearable. 

2. Hostility. Various forms of aversion to- 
ward the social environment may be regarded 
conceptually as an external attribution of re- 
sponsibility for the rejection. Thus, by dis- 
liking the rejectors, by finding fault with them 
and the social situation generally, the person 
“accounts” for the rejection, and palliates 
the devaluation anxiety generated by it. 

An important question is whether the two 
hypothesized responses to rejection tend to 
occur in the same person. From one theoretical 
standpoint, it may be supposed that if in the 
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past the person has succeeded in lowering re- 
jection anxiety by internal attribution (self- 
depreciation), he should be less likely to pro- 
duce an external attribution (hostility). That 
is, the two responses should correlate nega- 
tively. On the other hand, causal attribution in 
general may have been “reinforcing,” in which 
case all responses in that class would tend to 
be positively correlated in individuals. From a 
different theoretical slant, a positive correla- 
tion would result if the relative contributions 
to the rejection are uncertain. Where the per- 
son is not sure of the specific basis for being 
rejected, he might be expected to implicate 
both himself and the rejectors. Thus, an un- 
equivocal prediction regarding the relation- 
ship among responses to rejection is not possi- 
ble. 

There are of course, all degrees of rejection. 
On the assumption that stronger rejection 
would tend to produce more vigorous self- 
adaptive responses, an attempt was made 
operationally to establish Implicit and Ex- 
plicit levels of rejection. As it turned out em- 
pirically, the differences in effects between the 
two conditions were negligible. Ss under the 
two conditions were, therefore, combined for 
purposes of analysis. 


METHOD 


Ninety-five college undergraduate male volunteers 
served as Ss. Each three-person group consisted of one S 
and two confederates.? The behavior of the confederates 
was completely prescribed in advance by a well re- 
hearsed script. 

After assembling the group in the laboratory, EZ 
informed the Ss that they were going to take part in 
an important survey concerned with the breakdown of 
family structure. The purpose of the study was to 
investigate the attitudes of college undergraduates 
toward two issues relevant to family disorganization: 
(a) premarital sex relations, and (6) drinking among 
minors. Specifically, Ss were asked to reflect upon the 
question: “Do you, in general, encourage or discourage 
premarital sex relations?” and to jot down the three 
strongest reasons for their opinion. Lots were then 
ostensibly drawn to determine who would discuss his 
views first. However, the drawing was so contrived 
that S always led off. He declared himself either pro 
or con on the question and then elaborated his views 





2 There were actually four individuals who played 
the two confederate roles approximately the same 
number of times in each experimental condition. 
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by citing the reasons he held to be decisive. Finally 
he was requested to raise up in full view one of a series 
of cards numbered 1 to 6, representing the degree to 
which he personally either encouraged or discouraged 
premarital sex. The identical procedure was followed 
by the two confederates. In the No Rejection condition, 
the confederates held the same opinions as S, and 
assigned approximately the same numerical weight to 
them as did S. In both the Implicit and Explicit condi- 
tions, the confederates differed from S by adopting the 
diametrically opposed opinion. The reasons given by 
the collaborators and the numerical strength of their 
opinions were such as to maximize the isolation of S 
while maintaining the naturalness of the situation. 

Following the 15-min. discussion, the groups under 
the rejection conditions were given a brief recess before 
taking up the second topic. The Explicit Rejection 
procedure took place during this recess. EZ left the room 
mentioning that he had forgotten some forms. The two 
confederates then became engaged in friendly con- 
versation. They were instructed to ignore or to respond 
curtly to any of S’s questions or remarks, and to present 
an unfriendly demeanor when glancing at S. A number 
of specific actions reinforced the personal rejection. For 
example, one confederate offered a cigarette (if S was 
a smoker) or chewing gum to the other, but pointedly 
neglected S. In the Implicit Rejection, there was silence 
during this period, unless S initiated conversation, in 
which case the responses of the confederates were 
minima! and neutral. 

When £ returned, he reluctantly announced that 
time prevented any further discussion of the topic; 
that it was a closed issue. He added that he would 
like to have them make a few ratings which would be 
helpful in interpreting the discussion. Following this 
rating procedure, Ss were asked whether they would 
like to return for a discussion of the second topic with 
the same group. Assurance was given to each S through- 
out the postdiscussion procedure that his answers 
would be treated anonymously and confidentially. No 
S would know how the other responded. After collection 
of these data, EZ informed the S of the nature of the 
experiment, exposed the dissimulation, and requested 
that he refrain from discussing it with others. 


RESULTS 
Adequacy of Experimental Operations 


Two questionnaire items may be examined 
for evidence that the manipulations had the 
desired effects. In one case, S is asked to make 
a quantitative rating of the degree he believes 
he is liked by the other Ss. Responses were re- 
corded on a 20-unit scale ranging from “dislike 
me very much” on the low end to “likes me 
very much” on the high. The second item re- 
quests a rating of how valid he thinks the 
other Ss regard his opinions as expressed in 
the discussion. Responses on this 20-unit scale 
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could range from “highly invalid” on the low 
end to “highly valid” on the high. 

Table 1 contains the mean ratings of the 
two items under the experimental conditions. 
It is clear that the experimental treatments 
had a significant effect on both measures. Re- 
jected Ss felt less well-liked personally and 
felt that their opinions were thought to be 
less valid than nonrejected Ss. It can also be 
seen that while the means drop regularly 
across conditions, consistent with the assump- 
tion of a quantitative continuum of rejec- 
tion, the differences between the Implicit and 
Explicit Rejection conditions are not statis- 
tically significant. Subsequent analysis will 
deal, consequently, with the Implicit and 
Explicit Ss combined into a single Rejection 
condition. 


Self-Depreciation 


As hypothesized above, an irreversible re- 
jection can be made more acceptable by taking 
the blame for it, that is, by lowering the self- 
evaluation in some degree in some circum- 
scribed area. Two sources of data were assumed 
to be relevant to the hypothesis. One is a 
listing of positive and negative personality 
characteristics that all but a few Ss provided 
following the questionnaire. The specific meas- 
ure of this “personal” self-evaluation consists 
of the number of positive attributes minus 
the number of negative ones. The second 
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TABLE 2 
MEAN RatTINGs ON MEASURES OF SELF-DEPRECIATION 
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measure of Self-depreciation is based on a 
quantitative rating by S of the validity of his 
opinions expressed in the discussion. Responses 
were recorded on a 20-unit scale. 

Table 2 contains the average discrepancy 
between S’s positive and negative self-evalua- 
tions, and the mean ratings of the validity of 
his opinions under the two experimental con- 
ditions. It is evident that the average number 
of S’s positive self-evaluations as well as the 
estimate of the validity of his opinions were 
lower in the Rejection than in the No Rejection 
condition. Thus, whether ie evaluation of 
himself is in personal terms or in terms of the 
worth of the opinions he holds, rejection defi- 
nitely produces Self-depreciation. 


Hostility 


An external attribution of responsibility 
hypothesized to follow from rejection can be 
reflected in many forms of social aversion or 
hostility. Several measures were developed 
for different aspects of this general response 
tendency. To get at personal hostility, Ss were 
asked to rate how much they liked or disliked 
each of the two confederates. A more im- 
personal measure of hostility called for S to 
rate the validity of each of the confederate’s 
opinions. As an indirect measure of social 
hostility, S rated his amount of interest in the 
discussion issue. All of these ratings were on 
20-unit scales. Finally, the most general meas- 
ure of hostility was a quasi-behavioral one—a 
statement of whether or not S would commit 
himself to return for another discussion with 
the same discussion partners. 

Table 3 shows the mean ratings on the three 
scales and the percentage of Ss who express a 
willingness to return, under the two experi- 
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mental conditions. All four measures assumed 
to represent hostility were significantly affected 
by rejection. Not only were the rejectors’ 
opinions held to be less valid, but personally 
the rejectors were more disliked than the dis- 
cussion partners were liked by Ss in the No Re- 
jection condition. Moreover, Ss who had been 
rejected found the discussion topic less in- 
teresting than the Ss in the No Rejection con- 
dition. Finally, for whatever reason, many 
more rejected Ss were averse to returning for 
another discussion. In all, there is little doubt 
that the Ss who were implicitly or explicitly 
rejected find their social environment rela- 
tively unattractive. 

We turn now to a consideration of whether 
the responses produced by the experimental 
conditions are in any way related to each 
other within the same individual. Table 4 con- 
tains the product moment coefficients of cor- 
relation among the responses under each ex- 
perimental condition. With respect first to the 


TABLE 3 
MEASURES OF VARIOUS ForMs or HOSTILITY 


No Re- Re- 
jection | jection p 

N= N= (one-tail) 
32) 63) 


18.3 < .03 


Personal liking of | 21.3 
others’* 

S’s evaluation of 
others’ opinions» 
S’s interest is dis- 

cussion 
S’s commitment to 


return 


28.4 | 17.6 < .001 


15.4 | 13.8 < .03 


69% | 30% <.03 


* Ratings of the two confederates combined 
> Ratings of the two con/ederates combined 
© Chi square with 1 d/ 
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Rejection condition, there are two trends worth 
noting: (a) self-evaluation, in terms of the 
validity of one’s opinions, tends to be posi- 
tively correlated with each of the three meas- 
ures of Hostility, and (0) liking others is 
positively correlated with one other measure 
of Hostility, namely, the evaluation of the 
opinions others hold. 

Of greatest theoretical interest are the sig- 
nificant positive relationships between one of 
the Self-depreciation measures and two meas- 
ures of Hostility. To determine whether the 
experimental rejection had anything to do with 
this result, the correlations under the No Re- 
jection condition may be examined. It will 
be noted that the correlations between Validity 
of Own Opinion and each of the three Hostility 
responses, like their counterparts under the 
Rejection condition, are positive (two of the 
coefficients are at least at the .10 level). It 
thus seems fairly clear that a positive relation- 
ship between Self-depreciation and Hostility 
does exist for some measures, but is not due 
to the effects of experimental rejection. 

In fact, when the differences between the 
correlation coefficients are tested for sig- 
nificance (by way of the s transformation and 
the SE of the difference between z scores), 
none of them is significant. In other words, no 
correlation seems to have been affected by 
the experimental rejection. Thus, the rela- 
tionship between liking others and the eval- 
uation of their opinions is significantly posi- 
tive under the No Rejection condition as it 
was under the Rejection condition. It should 
also be noted that although the difference be- 
tween experimental conditions falls short of 
significance, two of the correlations in which 


TABLE 4 


CorrRELATIONS AMONG RESPONSES PropucED UNDER REJECTION AND No Reyjecrion CONDIvIONS 


Valid own Opinion 
No Rej. 
— .02 
+ .36 


Personal Self-Eval.* 
Valid own opinion 
Like others 

Valid other opinion 


Like Others 


Valid Others Interest 
No Rej Rej 
+.39 
+ .43* 
—.12 


+.12 


Rej. No Rej. Rej 


— .05 
+ .28* 
+.21 
+ .03 


— .03 
+ .26 
+ .43* 


—.15 
+ .18 
+ .37* 


— .05 
+ .46* 





* These coefficients are at least of the size required for the .05 level of significance based on both tails of the ¢ distribution. 
* Note that correlations involving this measure are based on NV of 21 and 41 in the No Rejection and Rejection conditions re 


spectively. All other measures are based on Ns of 32 and 63 





CONSEQUENCES OF REJECTION 


Personal Self-evaluation is involved tend to 
be strongly positive (p < .10) under the No 
Rejection condition. There is no ready ex- 
planation of this, assuming ‘hat it is not merely 
a chance pattern. 

DISCUSSION 


There seems little doubt that rejection pro- 
duces an increase in the strength of both self- 
depreciation and hostility in the sample of 
Ss employed. Such findings are consistent 
with the hypothesis that an unchangeable re- 
jection tends to bring about self-adaptive 
attributions of causality. Since, however, the 
experiment cannot prove the hypothesis un- 
equivocally, it becomes necessary to summarize 
the arguments that support the hypothesis 
and oppose alternative interpretations. 

Considering the finality of the rejection, the 
clear impossibility of communicating with the 
rejectors, and the anonymity of the responses 
it may be assumed that the behaviors pro- 
duced by rejection did not primarily have a 
socially adaptive function. Thus, concretely, 
it appears to us to be inadequate to interpret 
the hostility merely as inhibited aggression 
toward the rejectors. Similarly, under the 
circumstances, self-depreciation could hardly 
have been produced in order to gain the re- 
jectors’ sympathy and thereby to rectify the 
damage to the self. On the other hand, it 
seems more plausible to suppose that the man- 
ifestly irreversible rejection strongly inclined 
the S to adjust himself to the negative situa- 
tion rather than to try to change the situation. 
Although presumably there are other kinds of 
self-adaptive responses, hostility and _ self- 
depreciation appear to represent attributions 
of responsibility most closely. Clearly more 
definitive criteria of self-adaptive causal at- 
tributions are needed for further confirmation 
and extension of the theory. 

It is important to point out that there were 
sources of devaluation anxiety in the experi- 
ment in addition to the rejection itself. For 
example, the discussion topic of premarital 
sex relations undoubtedly was a source of 
strong anxiety in many Ss under both experi- 
mental conditions.’ It is quite probable that 

* Concerning this topic, the breakdown “For” and 
“Against” was approximately the same in all experi- 
mental conditions. 
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the shame engendered in this way produced 
self-depreciation and even some hostility. In 
addition, there is the possibility that the very 
articulateness and urbanity of the confederates 
in agreeing and disagreeing with the Ss made 
some of the latter feel extremely inadequate. 
This state of affairs might very well have 
created anger toward the confederates and 
attendant guilt, which in turn could have pro- 
duced some self depreciation. Here again, how- 
ever, granting our “rationalistic” classifica- 
tion of responses into socially and self-adaptive 
categories, there would not seem to be much of 
a basis for such “retaliatory” anger. 

Given the motivation to make the self- 
adaptive responses of hostility and self-depre- 
ciation in both experimental conditions, what 
accounts for the fact that seemingly alternative 
self-adaptive responses tend in some cases to 
be positively correlated? One compelling possi- 
bility may be mentioned. In the concrete ex- 
perimental situation of the S, it was not at all 
clear who was responsible for the rejection. It 
may be supposed that when the causal situa- 
tion is basically uncertain, persons who are 
motivated to make self-adaptive responses 


tend to “spread” the responsibility over the 
major possible causes. Such a tendency would 
make for positive correlations among the re- 
sponses. 


SUMMARY 


The experiment was designed to test the 
hypothesis that irreversible rejection gives 
rise to attributions of responsibility for that 
rejection, toward the self (self-depreciation) 
and toward the rejectors (hostility). The 
theoretical basis for the hypothesis was that 
personal devaluation anxiety engendered by 
the rejection can be made less unacceptable 
to the self through a causal analysis of it. 
The data were consistent with the hypothesis. 
A correlational analysis disclosed that a posi- 
tive correlation existed be» ween one of the self- 
depreciation responses and two hostility re- 
sponses under both conditions of Rejection 
and No Rejection. The discussion pointed to 
sources of devaluation anxiety other than re- 
jection, and to the supposed tendency for the 
person to spread responsibility in a situation 
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where the cause of the anxiety is relatively un- 
clear. 
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HE purpose of this experiment was to 
study the effect of stimulus pooling on 
judgments of personal attractiveness, 
where, in this con‘ext, stimulus pooling refers 
to the assembling of stimulus components into 
composite stimuli. An understanding of the 
relations between judgments made to com- 
posite stimuli and judgments made to com- 
ponents presented alone is of fundamental 
importance for the psychology of judgment. 
It has, in addition, important implications for 
social perception, e.g., for the development of 
social stereotypes and the formation of im- 
pressions of others. 

Very briefly, the method employed was 
first to scale identification photographs on 
personal attractiveness, and then to assemble 
these photographs into sets of two or three. 
These sets, or composite stimuli, were then 
judged for overall attractiveness by additional 
judges. 

One particularly simple kind of relationship 
that might be expected to hold between meas- 
ures of judgments made to composite stimuli 
and those made to the stimulus components 
presented alone is that the former are equal 
either to the mean or the sum of the latter, 
depending upon the kind of task set for judges. 
Following Gulliksen (1956), this “linearity” 
or “additivity” of pooling effects may be 
derived by starting with the assumption 
that the rate of change of subjective value, », 
with respect to an objective stimulus measure, 
x, is constant. This corresponds to the differ- 
ential equation 


do/dx = k [1] 


1 This paper is based on a dissertation submitted 
to the graduate school of the University of Wisconsin 
in partial fulfillment of the requirements for the PhD 
degree. The writer wishes to express his sincere appre- 
ciation to J. C. Gilchrist, under whose supervision this 
investigation was conducted, and to David A. Grant 
and Leonard Berkowitz, who served as members of 
the author’s doctoral research committee. 

* Now at the Carnegie Institute of Technology. 
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Upon integration and setting x = x» when 


v = O, we obtain 
v= k(x — xp) [2] 
For stimulus components i and 7, and com- 
posite stimuli 47, 
1 = k(x; — Xo) [3] 
vj = k(x; — x) [4] 
053 = R(xy + 2; — Xo) [S] 


By eliminating x; and x; between Equations 
3-5, we have 


03 = 1% + 05 + Rx [6] 


Since scaling data give, not the subjective 
values, v, but scale values, s, which differ 
from the former by a constant, we substitute 
aig = C= %,5;-C= vj, and 5; — C = D453. 
The resulting expression is 


Sig = 54 + 55 + (Rx — C) [7] 


Because judges in the present experiment 
were requested to respond to the mean of 
component scale values, rather than to their 
sum, we rewrite Gulliksen’s equation (1956) 
as follows 


ss = (at * “) +K (] 


This indicates that a plot of the scale values 
of composite stimuli against mean scale values 
of stimulus components will be linear with 
intercept K = }4 (ka — C) and—note—unit 
slope. When kx — C = 0 in Equation 7, or 
K = 0 in Equation 8, we may speak of additiv- 
ity, whereas the term linearity may be reserved 
for the more general relationship. 

Gulliksen (1956) working with food prefer- 
ences and using the method of pair compari- 
sons, found Equation 7 to fit his data as well 
as three more complex functions, viz. logarith- 
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mic, square root, and negative exponential. 
Rimoldi (1956) working with “interest ‘in 
knowing’ the person or persons” and using 
three different methods of scaling, found 
correlations between obtained scale values of 
composite stimuli and those predicted from 
an additive relationship (Equation 8 with 
K = 0) to be .93, .97, and .96. 

On the other hand, what is known about the 
nature of social stereotyping suggests that 
under certain conditions, judgments made 
about composite stimuli (social groups) may 
not be so simply related to the mean judgments 
made about stimulus components (individual 
members of the groups involved).* 

Campbell (1956), for example, has derived 
a principle of “enhancement of contrast” 
from the concept of stimulus generalization, 
whereby the prediction is made that the 
extent of overlap between two groups will be 
underestimated at intermediate levels of 
learning, due to a reduction of the variances 
of judgments within a group. Generalization 
among members of a group causes those 
relatively high on the continuum to be un- 
derestimated, while those relatively low are 


overestimated. This prediction was verified ex- 
perimentally, using nonsense syllables. Clarke 
and Campbell (1955) found a slight degree of 
such relative exaggeration of group differences 
for white students’ estimates of the examina- 
tion scores of their Negro and white class- 


mates. 
However, Campbell’s derivation applies to 
judgments of individual group members, not 
to judgments of the groups as such. A study 
which demonstrates the operation of enhance- 
ment of contrast in judgments of groups is that 
of Kephart (1954). He found a marked tend- 
ency for policemen to exaggerate the Negro- 
white crime rate differential in their districts. 


*The situation is complicated—as, for example, 
Allport (1954) has pointed out—by the fact that social 
stereotypes often derive, not only from contact with 
group members, but also in large part from attitudes 
assimilated from nonmembers, especially those belong- 
ing to the individual’s reference groups. Furthermore, 
systematic distortion of memory traces, so well docu- 
mented by Bartlett (1932), plays an important role in 
the development of stereotypes. An analysis of such 
effects, however, is outside the scope of the’ present 
paper 
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Interestingly, the higher the actual arrest 
rate in the district, the greater the tendency 
towards overestimation. This fact by itself 
does not imply that pooling effects are neces- 
sarily nonlinear, but it does imply that Equa- 
tions 7 and 8 are not sufficiently general. In 
order to obtain the kind of enhancement of 
contrast found by Kephart, a slope greater 
than unity is required. 

So far as the author is aware, no prior in- 
vestigation has made a direct comparison of 
judgments of composite stimuli and those of 
stimulus components and at the same time has 
employed stimulus material somewhat analo- 
gous to that upon which judgments of social 
groups and their members is based. The main 
question to which the reported research was 
directed may be formulated as follows: 

Are the processes involved in responding to 
composite stimuli by themselves sufficient to 
yield the exaggeration of group differences so 
often found in social sterotypes? 

In the absence of any precise knowledge of 
the weighting laws involved, it seemed desir- 
able to include as independent variables, in 
addition to differences in means of composite 
stimuli, such “intragroup” factors as heter- 
ogeneity and skewness of component scale 
values about composite stimulus means. 
Because interest was centered about pooling 
effects, it seemed obligatory to include as an 
independent variable the number of com- 
ponents per composite stimulus. Presumably 
any effects due to stimulus pooling, such as 
enhancement of contrast, should be accentu- 
ated by increasing the number of components 
pooled. Certain other variables, such as sex of 
judges, sex of photographic subjects, and order 
of presentation, were introduced primarily as 
experimental controls. 

And finally, one additional variable, the 
number of judges working together, was in- 
cluded for the purpose of testing a specific 
hypothesis. It was expected that pairs of 
judges working together, making a single, 
collective response to each stimulus, would 
produce distributions of judgments with less 
dispersion than those produced by individual 
judges. It was hypothesized that a process of 
social accomodation would occur, whereby 
partners would reconcile differences in judg- 
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ments by compromising, thereby avoiding 
extreme judgments. 


METHOD 


Subjects. The Ss were 212 University of Wisconsin 
undergraduates recruited from elementary psychology 
classes during the academic year 1956-57. Thirty-two 
Ss served as judges during the scaling of the stimulus 
components, while 180 Ss participated in the judging 
of the composite stimuli. 

Scaling the stimulus components. Approximately 
1,500 photographs, 144 in. square, of students entering 
the University in the fall of 1950, were obtained. From 
this initial group, two subgroups of 200 each were 
selected of males and females. Each subgroup included 
relatively high proportions of extremely attractive and 
extremely unattractive individuals. 

Values on a scale of personal attractiveness were 
obtained for each of the 400 photographs by means of 
the method of equal-appearing intervals. The judges, 
16 males and 16 females, used a sorting rack containing 
nine categories labelled A through I. A front cover on 
the rack prevented judges from observing the number 
of photographs placed in each category, thus increasing 
the independence of judgments. No restrictions were 
placed on the number of photographs to be placed in 
each category. Instructions (cf. Willis, 1957) were read 
by the judge, and then £ explained the main points of 
the task. One of the two groups of 200 photographs 
was sorted first, then the other. Order of presentation 
was balanced both for male and female judges. 

Experimental design: Overview. Theentire experiment, 
subsequent to the scaling of components, may be 
considered and analyzed either as three 2* X 4 orthog- 
onal factorials,‘ or as two interlocking 2 X 4 orthog- 
onal factorials. In all, there were eight factors, of which 
seven had two levels and one (heterogeneity) had four. 

Assembling the composite stimuli. Once the scale 
values of attractiveness were obtained, photographs 
were assembled into composite stimuli, or sets, and 
mounted on 3 X 5 index cards. Each set consisted of 
either two or three photographs (2-sets and 3-sets, 
respectively). Sets were systematically varied with 
respect to the following variables: (a) set size—the 
number of photographs per set, (b) set mean scale value 
(MSV)—the mean attractiveness scale value of set 
components, (c) helerogeneity—mean deviation of 
scale values about MSV, (d) absence or presence of 
skewness of the distribution of scale values about MSV, 





‘There were two error terms, Error Term I (MS, 
Js within conditions), and Error Term II (MS, resid- 
ual). Error Term I was used to test the significance of 
differences between independent groups, while Error 
Term II was used with tests based on possibly correlated 
data, i.e., tests of sources of variation involving one or 
more of the factors of heterogeneity, MSV, and sex of 
photographic subjects—and thus also involving re- 
peated measures. The df for Error Term I was equal 
to 32 times the number cf series involved, while the df 
for Error Term II was ejual to 480 times the number 
of series involved. 
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and (e) sex of photographic subjects, all photographs in 
a set being of the same sex. 

Assembling the subseries and series. A number of 
sets were assembled into a subseries, a subseries being 
the sets given to a judge to sort at one time. All sets 
within a subseries were of the same size and sex. Each 
judge was required to sort matching male and female 
subseries, and order of presentation was balanced for 
each group of judges. There were three pairs of match- 
ing subseries, each pair consisting of a male and a female 
subseries. Each pair of matching subseries formed a 
series. Series I contained 2-sets. Series II contained 
symmetrical 3-sets, and Series III contained highly 
skewed 3-sets, where the measure of skewness was twice 
the intermediate scale value minus the sum of the 
other two. Within each MSV condition, skewness was 
always in the same direction; in the high MSV condi- 
tion, the most extreme component in :-ach set was also 
the one with the highest scale value, whereas the most 
extreme component in each low MSV set was the one 
with the lowest scale value. 

Within each series, sixteen set types were represented. 
These set types were formed by the various combina- 
tions of male and female photographic subjects, high 
and low MSV, and four levels of set heterogeneity. 
Each set type in Series I was represented by six sets. In 
Series IT and ITI, each set type was represented by four 
sets, with the exception of two set types which were 
represented by two sets. The six subseries were all 
matched, set type by set type, on both MSV and 
heterogencity. In general, it was possible to achieve 
excellent matching. 

Obtaining the judgments of composite stimuli. Sixty 
judges were used with each series, each judge being 
used with only one series. In half the sortings, a single 
judge responded to each set, using the same sorting 
rack used in the scaling. In the ‘ther half of the sortings, 
two judges of the same sex worked together in an 
informal way, making decisions about the placement of 
sets into the sorting rack. Judges working alone read a 
set of instructions which were identical with that used 
in the scaling, except for the addition of an instruction 
to take all photographs of each set into account in 
making judgments. Instructions read by partners. were 
identical to those read by set judges working alone, 
except for an additional item instructing partners to 
alternate in the actual handling of sets, while deciding 
together about the placement of each set. Z reviewed 
the important points in the instructions for judges 
before they began, just as was done during the scaling. 

Sixteen scores were obtained from each J,® one for 
each set type. The score for a set type was the mean 
score for all sets of that type, while the score for a set 
was merely the number of the response category into 
which it was placed, categories being numbered 0 to 8, 
from least to most attractive. 





5 The word judge will hereafter be used to designate 
the individual, either working alone or with a partner, 
while the symbol J will be reserved to refer to the 
judging unit, which may consist of either one person or 
two. A J corresponds to a subject for purposes of 
statistical analysis. 
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Fic. 1. The design of the experiment: the sixteen set types within a series, the eight subgroups of Js within a 
series, and the relationships between series with respect to set size and presence of skewness within sets. 


Experimental design: Conditions. For each series, Js 
were balanced with respect to sex and number of 
judges per J. Order of presentation of subseries was 
also balanced. The entire design, which is diagrammed 
in Fig. 1, was replicated five times. In Fig. 1A are 
shown the sixteen set types that occurred in each series. 
There were eight experimental conditions for each series, 
each containing five Js, as shown in Fig. 1B, making 
120 Js in all. Fig. 1C diagrams the interrelations be- 
tween series. SeriesI and II are both free from skewness 
within sets, but differ in set size. Series II and III have 
set size in common, but the sets in Series III are skewed 
to the full extent allowed by the specified heterogeneity. 
Series I and III thus differ both in size of set and pres- 
ence of skewness. All three series were matched on 
each of the six factors shown in Figs. 1A and 1B. Each 
series, taken separately, forms an orthogonal design in 
these six factors. Series I and IT taken together form an 
orthogonal design in these six factors plus set size, 
while Series II and III taken together form an orthog- 
onal design in these six factors plus skewness. Hetero- 
geneity, MSV, and sex of photographic subject are 
within-J factors; number of judges per J, sex of judges, 
and order of presentation exist between conditions 
within series; and set size and skewness are between- 
series factors. 


RESULTS 


Overall main effects. In Table 1 are shown the 
subgroup means obtained when data from all 
three series* were subdivided by levels of each 
factor in turn. The results of the tests of signifi- 
cance are reported in Table 2 (See Footnote 
4). Heterogeneity, set MSV, and sex of photo- 
graphic subjects, tested against Error Term 
II, are all highly significant. None of the re- 
maining overal] main effects approach signifi- 
cance. 


*There are two exceptions: the set size and the 
skewness comparisons were based on data from two 
series only (Series I and II, and Series II and III, 
respectively), to avoid confounding the set size and 
skewness effects within a comparison. See Fig. 1C. 


Set heterogeneity in moderate degree showed 
no effect, but mean set judgments were 
slightly lowered in the condition of maximum 
heterogeneity. A Duncan Range Test (Dun- 
can, 1955) indicated that the three highest 
means were not significantly different from 
one another, but that the lowest mean was 
significantly lower than the next lowest mean, 
at the .01 level. Consequently, a “cluster- 
straggler” conclusion is accepted. Two facts 
stand out in Table 3, the second more interest- 
ing than the first: 

1. Mean set judgments are slightly higher 


TABLE 1 
MEAN JupGMENTS OF SETS WITHIN CONDITIONS* 








Heterogeneity of set components 
Minimum 
High 
Maximum 

Set mean scale value 


iw 


High 

Sex of photographs 
Male 
Female 

Number of judges per J 
One 


ee ke &S Seee 


Two 
Sex of judges 
Male 
Female 
Order of presentation 
M-F 


F-M 
Set size 
Two 
Three 
Skewness 
No skew 
Skew 


wb wR ww& 
an an 


a 





i) 
an 





® Based on all data, with exceptions, specified in Footnote 6 
in which comparisons were based on data from two series only. 
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TABLE 2 
SIGNIFICANCE Tests OF Main Errects* 








Source of Variation MS 





5.88 
1055.50 


Heterogeneity 

Set mean scale value 
Sex of photographs 4.59 
Number of judges per J 9.15 
Sex of judges in J 7.83 
Order of presentation 3.77 
Set size 4.19 
Skewness 4.13 





* Based on all data, with exceptions, specified in Footnote 6 in 
which comparisons were based on data from two series only. Error 
Term II was used to test heterogeneity, set mean scale value, and 
sex of photographic subjects, while the other main effects were 
tested against Error Term I. See Footnote 4. 


TABLE 3 


COMPARISONS WITH RESPECT TO MEAN SCALE VALUE 
AND MEAN SET JUDGMENTS 








Series III Overall 


| 
Series I | Suton as 
| 





(Level of 
MSV) 


Low | High 


(Level of 
MSV) 
Low | High | Low | High 
SE 
| 
2.14 | 4.15 | 2.16 | 4.16 | 2.45 


2.04 4.71 | 1.91 4.61 2.02 | 


(Level of (Level of 
MSV) MSV) 
| 

Low | High 





MSV | 2.14 | 4.15 
Ms] 2.10 | 4.42 


4.15 
4.58 











than the mean of the scale values of the set 
components. 

2. The sets of low MSV were judged below 
their MSV, while the sets of high MSV were 
judged above their MSV. The drop of the 
low sets was 0.13 and the rise of the high sets 
was 0.43, so that the gap between the two 
increased from 2.00 to 2.56, or about 25%. 

The overall difference between MSV and 
mean set judgment was significant beyond 
the .001 level, as indicated by a ¢ test. Addi- 
tional ¢ tests indicated that the drop in the 
judgments of low sets, the rise in the judg- 
ments of high sets, and the increase in the gap 
between conditions were all significant beyond 
the .001 level. It is not likely that these 
effects could be due to a biased selection of set 
components from the full scale, for the grand 
mean for the scale values of all 400 photo- 
graphs was 2.95, which is not far from the 
mean MSV of 3.15 for all sets. Furthermore, 
the direction of this difference would favor a 
downward shift, contrary to the results ob- 
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tained. Slightly more than half of the total 
number of component scale values on the full 
scale were between 2.15 and 4.15, the levels 
for the low MSV and the high MSV conditions, 
respectively, so the increased gap between 
conditions cannot be accounted for by the 
possibility that too small an initial difference 
in MSV levels was selected. 

Since judgments of sets are systematically 
different from the mean of judgments of set 
components, we conclude that the condition 
of additivity, as Johnson (1955, pp. 306-308) 
calls it, which has also been referred to as 
“the linear hypothesis” by Rimoldi (1956), 
does not hold for stimuli of the type used. 
Johnson and Rimoldi refer specifically to the 
situation described by Equation 8 with K = 0, 
but because of the pronounced enhancement of 
contrast found, we may conclude that Equa- 
tion 8 does not hold, no matter the value of K, 
for enhancement of contrast impiies a slope 
greater than unity. 

The remaining significant overall main 
effect is that for sex of photographic subjects. 
Sets of males were rated slightly higher than 
sets of females. Further results, discussed below, 
indicate that the interaction of sex of photo- 
graphic subjects with sex of J is more pro- 
nounced than the main effect. 

Analysis of variance. The data were subjected 
to extensive treatment by analysis of variance 
(cf. Willis, 1957). Briefly, the analyses con- 
sisted of (a) an analysis of Series I and II 
taken together, (6) an analysis of Series II 
anc ITI taken together, and (c) an analysis ot 
each of the series separately. The purpose of 
the first of these analyses was to test the signif- 
icance of tne set size factor and its interac- 
tions with certain other variables. The second 
analysis was performed in order to make 
similar tests about the skewness factor and its © 
interactions. A primary advantage of the 
series-by-series analyses is that they make it 
possible to take into account the consistency 
of results running through three independent 
groups of Js, an important consideration. In 
fact, it seems reasonable to give as much or 
more weight to consistency as to level of 
significance per se. Because of the large number 
of F ratios computed (63 per series), very 
conservative standards were adopted for 
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accepting a finding as significant. The null 
hypothesis was rejected only for those sources 
of variation which were significant at or 
beyond the .05 level for all three series.” 

The analyses of variance gave additional 
support for the significance of the main effects 
for heterogeneity and mean set value, but 
neither the main effect due to set size, nor 
that due to skewness, approached significance. 
Two interaction effects were found to be 
highly significant, however. First, the Series 
I-II analysis revealed that the difference in 
mean set judgments between the low and 
high MSV conditions is larger for 3-sets than 
for 2-sets, as shown in Table 4. This effect is 
significant beyond the .001 level. Not surpris- 
ingly, the effects of stimulus pooling were 
greater in the condition in which the larger 
number of stimulus components were pooled. 
It should be noted that the range of component 
scale values, both within high and low MSV 
conditions and overall, were very nearly the 
same for all three series, thus eliminating the 
possibility that differences in ranges could 
account for the obtained results. 

Second, the series-by-series analyses showed 
that both male and female /s tended to be 
more lenient towards their own sex (Table 5). 
This pattern of differences held for all three 
series, reaching the .01 level of significance 
in Series I and ITI, and the .05 level in Series 
II. The differential is a little more than twice 
as large for male Js as it is for female Js. 

Effects of number of judges per J on variance 
of judgments. In order to investigate the 
hypothesis that judgments made by partners 
exhibit less variance than those made by 
individual judges, mean variances were com- 
puted for the two J sizes. The mean variance 
for single judges was 37.97, while that for 
partners was 33.97. The difference, although in 
the expected direction, is not significant 
(¢ = 1.20, df = 118, p > .20). Pooling of 
responses did not produce a significant degree 
of convergence of judgments. Thus, even 
“the only consistent finding of significance for 
social psychology,” cited by Johnson (1955, 


’ For those sources of variation which were signif- 
icant at or beyond the .05 level for amy two of the 
three series, the null hypothesis was ‘fentatively rejected. 
See the original thesis (Willis, 1957) 
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TABLE 4 


SuBGROUP MEANS FoR INTERACTION DETWEEN MEAN 
VALUE AND Ser Size* 








Mean Scale Value 





Low 





2 2.10 
3 2.04 





* Based on Series I and II. 


TABLE 5 


Suscrourp MEANS FoR SEX INTERACTION BETWEEN 
Sex or Pxorocrapuic SusyectT AND SEX OF 
JupcEs* 








Sex of Photographs 





Male Female 





Male .48 
20 


3 
Female 3. 





* Based on all data. 


p. 471) in his discussion of group judgment, 
is not invariably found. 


DISCUSSION 


The most interesting results, from the point 
of view of social stereotyping processes, are 
(a) the highly significant enhancement of 
contrast, which resulted in an increased scale 
separation in the set-judging condition, and 
(6) the more pronounced nature of this effect 
with the sets having the larger number of 
components. These findings strongly suggest 
that, quite independently of learning and 
memory effects, there is a purely judgmental 
process that contributes to enhancement of 
contrast. 

The enhancement of contrast observed here, 
however, is of a different kind from that found 
by Campbell (1956). The judges in his experi- 
ment made judgments about group members, 
not about groups as such. He found that the 
amount of overlap between two groups was 
considerably underestimated, a result which 
he had predicted from considerations of 
stimulus generalization. Neither his prediction 
nor his result involved an increased separation 
between group means, such as was found in 
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the present experiment. In fact, his analysis 
calls for a decrease in the difference in group 
means due to intergroup stimulus generaliza- 
tion, a prediction which was very roughly 
confirmed by his data. Thus, although inter- 
group stimulus generalization may have 
played some role in the present experiment, 
its effects, if any, were more than neutralized 
by effects operating in the opposite direction 
on the differences in group means. 

What underlying mechanism, then, may 
account for this absolute exaggeration of group 
differences? One possibility is that it is the 
result of efforts on the part of Js to reduce 
the cognitive strain resulting from ambiguity 
by sharpening intergroup differences. It can 
be assumed that Js had more difficulty in 
forming definite impressions when judging 
sets than when judging individual components, 
and that this difficulty, or cognitive strain, 
was greater for 3-sets than for 2-sets. The 
intergroup sharpening, along with relative 
intragroup leveling, served to reduce the 
impression of ambiguity and the concomitant 
cognitive strain. We may hypothesize that a 
need to reduce cognitive strain may\also be 
one of the important motivations in the forma- 
tion of social stereotypes, as well as in other 
instances of sharpening, leveling, and enhance- 
ment of contrast occurring in social perception. 

A second possible mechanism, not mutually 
exclusive with the one above, is that Js were 
making judgments in terms of estimated 
probabilities, so that increasing distances in 
either direction from some central point on the 
response continuum corresponded to increasing 
degrees of unusualness. Then, in the set- 
judging condition, component probabilities 
were componded in some way to yield a lower 
probability for the set. This relatively low set 
probability would evoke a response assign‘ng 
the set to a position at relatively great distance 
from the central point on the response scale. 
To take an analogy from everyday experience, 
it would be judged somewhat unusual to see 
a woman six feet tall, but it would be judged 
more unusual to see two or three such women 
together. Unfortunately, the data at hand do 
not allow a choice between the two interpre- 
tations. 

Whatever the psychological mechanism or 
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mechanisms may be, it is easy to show what is 
necessary mathematically to remove the 
restriction of unit slope from Equations 7 and 
8, and so allow either for the possibility of 
sharpening (slope > unity) or leveling (slcpe 
< unity) of group differences. By assuming 
that a different rate-of-change constant 
applies for each group size, we replace Gullik- 
sen’s single differential equation (Equation 1) 
by a set of equations. For groups of Size 1 and 
2, we write 


dv,/dx, => ky [9] 


dtz/dx_e = ke [10] 


where subscripts identify group size, and » 
and # are, as before, subjective value and 
objective stimulus measure, respectively. 
Since x, equals ‘ -e sum of the two ™, Equa- 
tions 3-5 become 


(11) 
(12] 
[13] 


v1 = ky(xi — Xo) 


2 = hy(x; — Xo) 


045 = he(xi + 23 — Xo) 


By algebraic operations similar to those 
used to arrive at Equations 7 and 8, the 
following expressions are reached. 


Sig = Kilsy + 5;) + Ke [14] 


sy = Ky (at+) + K, [15] 


where Ky = ke/ hi, RK: = hex ~— (2he/ Ri ro 1)C, 
and K,’ = 44K>. Equation 14 applies when 
judges are requested to respond in terms of the 
sum of the component scale values (“How 
much would you like to have this portable pho- 
nograph plus this pen and pencil set?”), while 
Equation 15 applies when judges are requested 
to respond to the mean of the component 
scale values, as in the present experiment. 
When XK, > 1, group differences are exag- 
gerated, while when K, < 1, group differences 
are underestimated. For the data of Series I 
K, = 1.15, and K,’ = —0.37. For three com- 
ponents per composite stimulus, Equation 15 
may be generalized to read 
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[15] 


+ K, 


sun = K, (StU + 2) 


where Ky = ks/ky, Ke = 2RsxXo == (3h? / ky = 1)C, 
and K,’ = K,/3. For the data of Series I 
and [II taken together, K, = 1.34, and K,’ = 
—0.90. 

Another finding which may have implica- 
tions ior social stereotyping is the slight, but 
significant, drop in mean scale values produced 
by extreme heterogeneity of component scale 
values within sets. Because the effect is absent 
with moderate degrees of heterogeneity, 
predictions made on the basis of set means 
alone will be nearly as accurate as predictions 
taking into account individual component 
scale values; but because the effect appears 
with extreme component heterogeneity, knowl- 
edge of the distribution variance may, in 
extreme cases, contribute to predictive power. 
To see how this might apply to social stereo- 
typing, consider the example—entirely hy- 
pothetical—that it is known that Jews do not 
differ from other groups on mean introversion- 
extraversion, but are more widely dispersed 
about this mean, with higher proportions 
both of loud, aggressive individuals and of 
quiet, retiring ones. Stimulus pooling effects 
could conceivably contribute to a heavier 
weighting at one end of the continuum, and in 
this way facilitate the formation of a group 
reputation, even though the group in question 
does not differ in the mean from other groups. 

The greater leniency of Js towards the own- 
sex subseries is likely due largely to the fact 
that judges were rating the individuals por- 
trayed as potential dates or mates, and con- 
sequently adopted more critical standards. 
A second possibility is that, when judging 
members of one’s own sex, a process of group 
identification produced a tendency to judge 
less critically. Identification might also have 
caused judges to assign themselves a position 
on the scale, not necessarily consciously. Then. 
the more leniently a given judge judges his 
own-sex subseries, the higher a rating he 
implicitly assigns himself. 

In our culture, attractiveness is accepted 
by both sexes as being more important in the 
case of young women than it is for young men. 
This suggests that the date-rating set would be 
more pronounced for males judging females, 


but that group identification would be more 
salient for females judging females. Because 
the difference in mean set judgments is more 
than twice as large for male Js as it is for 
female Js, it seems probable that the first 
process is the more important one. 

Turning to the negative finding of principal 
interest, the failure to find a significantly 
smaller variance of judgments in the “part- 
ners” condition, the most probable explanation 
is the lack of any but arbitrary, ambiguous 
scale units. This situation allowed partners, 
as compared to individual judges, to shift to a 
different equivalence ratio between the stimu- 
lus scale and the response scale. Partners no 
doubt felt as much of an obligation as did 
individual judges to use more than a few of the 
response categories, and because of this they 
expanded the scale just about the required 
amount to compensate for the effect of re- 
sponse-pooling. Presumably, if the stimuli had 
been of a sort that more clearly defined scale 
units (if scale units had been inches or dollars, 
for example), and if the relationship of re- 
sponses to stimuli had been more clearly indi- 
cated (as with anchors), the reduction in 
variance would have been more pronounced. 
Experimentation to test this interpretation is 
currently in progress. 


SUMMARY 


The effects of stimulus pooling on judg- 
ments of personal attractiveness were investi- 
gated in an orthogonal factorial design, using 
as stimulus components identification type 
photographs which had been previously scaled 
on attractiveness. Composite stimuli were 
constructed from these components by mount- 
ing sets of two or three photographs on cards. 
These compesite stimuli were varied system- 
atically along several dimensions. Judgments 
of sets were then obtained under conditions 
similar to those under which components had 
been scaled. The most clear-cut and firmly 
demonstrated positive results were: 

1. Mean set judgments (MSJs) were not 
equal to mean scale values (MSVs) of set 
components—the additivity condition did not 
hold. Rather, for sets of high MSV, MSJ > 
MSV, while for sets of low MSV, MSJ < 


MSV. 
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2. This enhancement of contrast was signifi- 
cantly more pronounced for sets of three 
components than for sets of two. 

3. Extreme heterogeneity of component 
scale values slightly lowered MSJs; moderate 
heterogeneity had no significant effect. 

4. Judges were more lenient in judging the 
attractiveness of own-sex photographs than 
other-sex photographs. 

Negative results worthy of note were: 

1. Judges working in pairs produced MSJs 
which were not significantly different, either 
in mean or variance, from those made by 
judges working alone. 

2. The main effect due to set size was not 
significant. 

3. The main effect due to the amount of 
skewness of component scale values about set 
MSV was not significant. 

The implications of the findings for (a) 
social perception, especially social stereo- 
typing, and (6) group judgment were con- 
sidered. 
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ACHIEVEMENT MOTIVATION IN FANTASY AND BEHAVIOR! 


DONALD M. BROVERMAN, EDWARD J. JORDAN, JR.,.? anp LESLIE PHILLIPS 
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VIDENCE that motives can influence the 
E content of perception has led to at- 

tempts to measure motivation by means 
of projective fantasy. McClelland, a leader in 
this movement, has presented a technique of 
assessing such motives as need Achievement, 
need Affiliation, and need Power from re- 
sponses to modified versions of the TAT. 
Experimental work (McClelland, Atkinson, 
Clark, & Lowell, 1953; Atkinson, 1958) has 
indicated positive relationships between these 
fantasy measures and indices of the same mo- 
tives derived from overt behavior. Such re- 
sults have been taken as evidence for what 
might be termed a general-expression model 
of motivation, i.e., the view that motives are 
concurrently reflected in both fantasy and 
overt behavior. 

Recently, however, the validity of this for- 
mulation has been questioned. Lazarus, Baker, 
Broverman, and Mayer (1957) found that 
achievement fantasy decreased with increasing 
age and education. Application of a general- 
expression model to this result implies a paral- 
lel reduction in the behavioral expression of 
achievement motivation with increasing age 
and education. This implication appeared im- 
plausible to Lazarus et al. Hence, these authors 
proposed that fantasy might serve as an al- 
ternative or substitute channel for the ex- 
pression of achievement motivation when this 
motive is blocked from behavioral expression 
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in real life. The higher achievement fantasy in 
younger, less well educated Ss was attributed 
to their inability to express achievement needs 
in behavior, while the older individuals were 
seen as more able to express such needs, with 
a concomitant reduction of fantasy about 
achievement. 

This alternative-channels model leads to the 
prediction of an inverse relationship between 
achievement fantasy and behavioral attempts 
to achieve. It thus stands opposed to the gen- 
eral-expression model of McClelland et al. 
(1953) which predicts a positive relationship 
between these two indices of achievement mo- 
tivation. 

The present study is concerned with an ex- 
perimental resolution of these conflicting 


models. In attempting such a resolution, it 
seemed to us that the kind of behavioral index 
of motivation to be used was of critical con- 
sequence. Most previous experimentation has 


used for this purpose performances on labora- 
tory experimental tasks, but in the light of a 
recent discussion of “task relevance’ by Vogel, 
Baker, and Lazarus (1958) we felt that the 
assessment of achievement motivation in be- 
havior by such means was of questionable 
validity. Laboratory tasks such as solving 
anagrams, solving arithmetic problems, or 
crossing out letters on a page would appear to 
have little relevance to an individual’s princi- 
pal interests in life, and might well yield data 
more influenced by such extraneous factors as 
the individual’s proneness to believe laboratory 
instructions or his need to please the examiner. 
Furthermore, this technique subjects all Ss 
to the same task, disregarding individual differ- 
ences in specific areas of interest. Even if some 
Ss were highly motivated by a laboratory 
task of arithmetic problems, for example, 
many others probably would not be. Finally, 
many experiments have intentionally used 
tasks that are relatively simple to perform, 
so that intelligence would not confound pre- 
sumed effects of achievement motivation in 
performance. However, evidence (Lewin, 
Dembo, Festinger, & Sears, 1944) suggests 
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that striving declines with relatively easy 
tasks, paralleling a decrease in level of aspira- 
tion. 

The preceding considerations impelled us to 
measure the behavioral expression of achieve- 
ment motivation by means of life history data, 
which appear much more likely to reflect be- 
haviors toward goals that are important to the 
individual. Life history data are unique to each 
individual, obviating the possibility that an 
irrelevant task is being imposed on S. Further- 
more, their use allows the individual to report 
on his attempts to achieve goals at a level of 
difficulty suitable to his achievement motiva- 
tion. 

The present study investigates the relation- 
ship between achievement fantasy and be- 
havioral indices of achievement motivation 
taken from the area of employment, a central 
concern for our Ss, who were adult males. A 
positive relationship between achievement mo- 
tivation in fantasy and behavior would be 
taken as supporting the general expression 
viewpoint on this motive, while a negative re- 
lationship would support the alternative-chan- 
nels formulation. In either event, the use of a 
behavioral index drawn from life rather than 
laboratory behavior was seen as lending greater 
weight to any evidence that might be obtained. 


METHOD 

Subjects 

Thirty-seven members of a local church were used 
in this study. All Ss were paid for their participation, 
but their pay went to the support of their parish. The 
sample was characterized by a wide range with respect 
to socioeconomic variables. Education varied from 5 
to 19 years, with the median at 13 years. Annual income 
extended from $3,000 to $10,000, with a median of 
$5,000. Job level, as defined by the Dictionary of 
Occupational Titles (DOT), ranged from managerial 
and professional to semiskilled, with a roughly even 
distribution of Ss in all job categories. Age ranged 
from 24 to 50 years, with a median of 36. 


Procedures 


Ss were administered the Worcester Scale of Social 
Attainment (WSSA) in a group. In a later session 
individual inquiry into responses was conducted. The 
WSSA is a broad assessment of the degree to which an 
individual meets certain standards of accomplishment 
in society, and is discussed more fully by Phillips and 
Cowitz (1953). Two sections of this scale were selected 
for use in this study: Striving for Advancement and 
Job Level. 

Striving for Advancement (WSSA, Item 7) taps the 
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extent to which an individual actively seeks to better 
his employment position. Scoring on this item is based 
on answers to the following check list. 

1. I would like to go to school to get a better job. 

(a) Do you have a particular school in mind? 
(6) Name of school? 
. I have visited plants looking for a better job. 
. I am interested in another job if it is offered. 
. I read the help-wanted ads every day. 
. I have asked my friends to look for jobs for me. 
. Iam going, or have gone to school lately for job 
improvement. 
. Sometimes I think about looking for something 
better. 

8. I have written letters to employers. 

9. I am keeping my eyes open. 

10. I have tried to get interviews. 

11. I would like to have a better job. 

12. I would like to learn a trade. 

13. I expect to work for myself. 

14. I would like to work for myself. 

Responses to these questions permit classification 
of Ss into following four categories: 

1. Desire and intense activity, e.g., letters written 
about jobs; interviews applied for; contacts for 
better position actively sought. 

. Desire and tentative activity, e.g., is interested in 
another job if offered; does not seek a job; does 
not write although may look through ads; has 
vague long range goals. 

3. Desire but no activity, e.g., says he would like a 
change but does nothing. 

4. No desire and no activity. 

Ss were placed in one of these four categories by two 
independent judges, whose judgments correlated .98 
(Pearson rank order correlation). 

Behavioral manifestation of the achievement motive 
is typically understood as a kind of striving measure, or 
a measure of how much effort is involved in trying to 
achieve; it does not necessarily reflect actual achieve- 
ment. At the same time, however, it seems possible 
that actual achievement may influence the amount of 
achievement fantasy produced by an individual. For 
this reason, S’s occupational level was used as a control 
for actual achievement in the area in which his be- 
havioral expression of achievement motivation was 
examined. This control measure was taken from the 
Job Level item of the WSSA (Item 1). This item assigns 
scores to each level of occupation, as defined by the 
DOT. Higher scores are assigned to higher job levels. 

The fantasy measure of achievement motivation 
was taken from the Role-Taking Task (RTT) devised 
by Feffer (1959). This is a projective test, based on 
Shneidman’s Make-a-Picture-Story (MAPS). Three 
background settings were used: a factory scene, a 
living room scene, and one other chosen by S from the 
MAPS materials. A variety of figures (men, women, 
and children) were provided for S to place in these 
scenes. Ss then told a story, using at least three figures, 
for each situation. After completion of the three stories, 
Ss were presented with each situation again and in- 
structed to retell their stories from the viewpoint of 
each character in the scene. 

RTT stories are scored for motivational content by 
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several criteria. Four of these were used in the present 
study, since they were McClelland’s need Achievement 
categories of Achievement Imagery (AI), Stated Need 
for Achievement (nAch), Anticipatory Goal States 
(GA+, GA—), and Affective States (G+, G—). Ina 
sample of 120 TAT stories presented and scored by 
McClelland et al., this portion of the need Achievement 
score correlated .95 with total need Achievement. 
Hence, it seems reasonable to assume that scores 
based on these four categories of need Achievement are 
highly representative of total need Achievement, as 
scored by McClelland et al. (1953). Each S’s achieve- 
ment fantasy score is the sum of themes in his RTT 
stories which satisfy the criteria of any of the above 
four classes. 


RESULTS 


Achievement fantasy scores were analyzed 
in a two-by-two factorial analysis of variance. 
Independent variables used were: Striving 
for Advancement (WSSA, Item 7) and Job 
Level (WSSA, Item i). Ss in the two higher 
striving categories were compared with those 
in the two lower categories. Job Level was 
divided at the median into high and low 
groups. The requirement of proportionality 
between the four cells (Lindquist, 1953) neces- 
sitated the random elimination of three Ss from 
the appropriate cells. The resultant N of this 
analysis is 34. Correlation between Striving 
and Job Level was negligible (rf = .11). 

Table 1 reveals that Striving is reliably asso- 
ciated with fantasy-expression of achieve- 
ment motivation (F = 4.35, p less than .05). 
Table 2 indicates that the direction of this 
relationship is inverse: high strivers show low 


TABLE 1 


ANALYSIS OF VARIANCE OF 
ACHIEVEMENT FANTASY 


(VN = 34) 








Source df MS P 





Striving 4.36* 
wee Level 
nteraction 


1 
1 
1 
Within Error 30 
Total 33 


79.53 
4.63 
4.99 

18.22 


SS = 635.88 





* F significant beyond the .05 level, two-tailed test. 


TABLE 2 


MEAN ACHIEVEMENT-FANTASY SCORES OF 
Hics anp Low Srrivinc Groups 








4.11 


High Striving (n = 17) 
i 6.47 


Low Striving (# = 17) 
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achievement fantasy, and low strivers show 
high achievement fantasy. 

Table 1 further indicates that Job Level and 
fantasy-expression of need Achievement are 
not significantly related.* 


DISCUSSION 


This study demonstrates an inverse rela- 
tionship between behavioral striving and 
achievement fantasy,‘ and thus supports an 
alternative-channels view of the expression of 
achievement motivation. Interpretation of 
these results must take account of the nu- 
merous studies which support the general- 
expression model by demonstrating positive 
relationships between achievement fantasy and 
striving. The alternative-channels view can no 
more account for these instances of positive 
relationships than the general-expression model 
can account for the present results. We are in- 
clined to attempt a resolution of this conflict of 
results via an extension of the alternative- 
channels model. 

We would approach this problem by sug- 
gesting that persuns unable to express their 
achievement needs in life-situations will be 
impelled to seek alternative modes of need ex- 
pression, not only in fantasy, as suggested by 
Lazarus et al. (1957), but also in behavioral 
striving when tasks simpler and less ambiguous 
than life problems are encountered. 

The experimental tasks used to assess be- 
havioral striving typically are safer and/or 
better structured than the situations in which 
most life goals are embedded. Also, because 
they are made easy, intelligence presumably 
plays little, if any, part in achievement on 
them. Therefore, persons who are blocked in 
expressing achievement motivation in real 


* Two other possible indices of achievement motiva- 
tion, educational level and annual income, also failed 
to predict achievement fantasy. While it is impossible 
to draw general conclusions from these findings, it 
appears that actual achieverrent was not a predictor of 
achievement fantasy in the present sample. 

* Vogel, Baker, and Lazarus (1958) also report an 
inverse relationship between achievement fantasy and 
a behavioral index of achievement motivation (per- 
formance on the McKinney Reporting Task). However, 
this relationship was found only with fantasy indices 
obtained after a failure-stress condition; no relationship 
was found with fantasy obtained under neutral condi- 
tions. Hence, the relevance of this finding to the 
theoretical issue discussed in this paper is unclear. 
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life might be expected to show a great deal of 
striving on such tasks. On the other hand, Ss 
who are able to express their achievement 
needs in life should have no need to exert them- 
selves on tasks so distant from real-life con- 
cerns. Such persons might be expected to show 
little striving in the laboratory. It then fol- 
lows from the present finding, i.e., a negative 
relationship between life striving and achieve- 
ment fantasy, that low laboratory strivers 
would be the low achievement fantasiers, while 
high laboratory strivers would be the high 
achievement fantasiers. Thus, a reversal of 
striving patterns from life to the laboratory 
may account for the positive relationships pre- 
viously observed between striving and achieve- 
ment fantasy. Such a reversal occurs, according 
to this argument, because performance of lab- 
oratory tasks, like fantasy, may serve as a 
substitute expression of blocked achievement 
motivation. 

The extended alternative-channels model, 
then, seems preferable to the general expression 
model for several reasons. It can account for 
both positive and negative relationships be- 
tween striving and achievement fantasy and it 
is supported by evidence drawn from life 
history data. Finally, this model is generally 
consonant with many current developments 
in psychological theory: the psychoanalytic 
emphasis upon thought and fantasy as con- 
sequences of delayed action (Rapaport, 1951); 
evidence presented by the Lewinian school for 
substitute activity and cosatiation (Lewin, 
1935); and demonstrations of the vicarious 
relationship of fantasy to action by develop- 
mentally oriented studies (Krus, Werner, & 
Wapner, 1953; Hurwitz, 1954; Kruger, 1954; 
Misch, 1954). 

Nevertheless, it is clear that any attempt at 
a definitive statement about the relationship 
between achievement fantasy and striving 
would be premature at present. The extension 
of the alternative-channels model advanced 
in the present paper has not yet been empir- 
ically tested. Furthermore, additional factors 
such as cognitive controls (Klein, 1954; 
Lazarus et al., 1957) and environmental con- 
ditions (Birney, 1959), may influence the kind 
of relationships that are obtained. 

One such factor, particularly pertinent to 
this study, is variation in fantasy-assessment 
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technique. The RTT, on which our results are 
based, differs from the usual avisievement- 
oriented TAT by allowing S some choice in 
determining the test stimuli and by requiring 
the retelling of stories from several viewpoints. 
While it is difficult to envision how these 
differences might reverse the direction of rela- 
tionship between striving and fantasied 
achievement, this possibility cannot be ruled 
out, and our findings must be qualified in this 
regard. 

In any event, the present study casts doubt 
on the unqualified assumption that achieve- 
ment motivation is generally expressed and 
supports the conceptualization of fantasy and 
behavior as alternative channels for the ex- 
pression of this motive. 


SUMMARY 


This study investigates the relationship of 
achievement fantasy to behavioral indices 
of achievement motivation. Conflicting views 
of this relationship have been presented. 
McClelland et al. (1953) have assumed that 
fantasy is a direct index of a generally ex- 
pressed motive state and have supported their 
position with findings of positive relationships 
between fantasy and behavioral indices of 
achievement motivation. Lazarus et al. (1957) 
on the other hand, have conceptualized fantasy 
as an alternative, substitute channel for the 
expression of achievement motivation when 
behavioral channels are blocked. This latter 
view implies an inverse relationship between 
fantasy and behavioral expression of motiva- 
tion. 

The present experiment is an attempt to 
determine which of these views is applicable 
when behavioral indices of achievement mo- 
tivation are derived from life history data. 

Thirty-seven adult males, representing a 
wide range of age and socioeconomic variables, 
were administered the Worcester Scale of 
Social Attainment (WSSA) and a Role-Taking 
Task (RTT). Two behavioral indices of 
achievement motivation were taken from the 
WSSA: Job Level, a measure of actual achieve- 
ment, and Striving for Advancement, a meas- 
ure of the degree to which an individual is 
actively trying to raise his level of occupational 
achievement. A fantasy measure of achieve- 
ment motivation was derived from the RTT. 
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An inverse relationship was found between 
the striving and fantasy measures of achieve- 
ment motivation. No significant relationship 
was found between actual achievement and 
either achievement fantasy or striving. 

These results were interpreted as supporting 
the alternative-channels model of Lazarus et 
al. (1957). Previous research in this area was 
criticized, and an attempt was made to extend 
the alternative-channels model to cover the 
reported instances of positive relationships be- 
tween achievement fantasy and behavioral 
expression of motivation. 
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HE theory of cognitive dissonance 
(Festinger, 1957) has led to empiri- 
cal investigation of the psychological 
consequences of inconsistencies between atti- 
tudes and behavior. It has been shown, for 
example, that a person induced to argue against 
his private opinion will tend to show change in 
his private opinion toward the induced po- 
sition (Cohen, Brehm, & Fleming, 1958; 
Festinger & Carlsmith, 1959). It has also been 
shown that simply engaging in a behavior in- 
consistent with one’s attitudes results in 
attitude change toward consistency with the 
behavior (Aronson & Mills, 1959; Mills, 1958). 
Other derivations concerning the consequences 
of inconsistency between attitude and behavior 
have yet to be tested. This paper reports a test 
of a relatively “nonobvious” derivation. 

According to the theory, when a person is 
induced to engage in some behavior that he 
ordinarily would avoid, he experiences dis- 
sonance. He then tries to reduce the dissonance, 
and, in the present case, one way he may do so 
is by changing his attitude toward the be- 
havior (or some aspect of it) in such a way as 
to reduce the reasons for avoiding it. Thus, an 
individual induced to perform a disliked be- 
havior should tend, among other things, to 
increase his liking for that behavior. 

The person’s tendency to change his attitude 
is conceived to be a direct function of the 
magnitude of dissonance experienced. Given 
his dislike of a behavior and with the inducing 
force held constant, the magnitude of dis- 
sonance increases as the amount of committed 
behavior increases. And given the fact that he 
has committed himself to some disliked be- 
havior, the less support that further informa- 
tion to which he is exposed gives to that be- 
havior, the greater the magnitude of dis- 
sonance. 

Specifically, then, the following derivations 
may be made. 

1 This project was conducted as part of the Yale 
Communications Research Program. It was supported 
by a grant to Carl I. Hovland from Bell Telephone 
Laboratories. 
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1. The greater the amount of disliked be- 
havior to which a person commits himself, the 
greater the magnitude of dissonance and conse- 
quent increase in liking for the behavior or the 
behavioral object. This hypothesis has been 
tested by Aronson and Mills (1959), who 
showed that the more severe a person’s initi- 
ation in joining a group, the greater his liking 
for the group members and their activities. 

2. Given commitment to disliked behavior, 
the less supportive is additional information, 
the greater the magnitude of dissonance and 
consequent increase in liking. This effect is 
expected to increase as a function of the amount 
of committed behavior. The present study was 
designed to test this derivation. 

The theory also asserts that a person ex- 
periencing dissonance tries to reduce it and to 
avoid increasing it. Thus, the extent to which 
information either for or against engaging in 
the behavior is believed should be a function of 
the amount of committed behavior. This con- 
sideration gives rise to another derivation 
tested by this study. 

3. Supporting information is accepted and 
nonsupporting information is rejected by a 
person in proportion to the amount of disliked 
behavior to which he is committed. 


METHOD 


The general procedure has been described in a 
previous report (Brehm, 1959). Briefly, it consisted of 
(a) a pre-experimental measure to ascertain a vegetable 
disliked by each of the subjects (Ss), (6) an individual 
experimental session in which each S was induced to 
commit himself to eating some of the disliked vegetable, 
(c) exposure of the S, after he was committed, to in- 
formation in support of or against eating the disliked 
vegetable, and (d) measurement of relevant affect and 
opinions before and after the experimental inductions. 

The pre-experimental questionnaire was given to 
eighth-grade students in their English classes.* It con- 
sisted of a scale for each of 34 vegetables. S 
was to indicate how much he liked each vegetable by 
an X on any one of 61 dots running along a straight 


* For their help in conducting this study, the author 
wishes to thank Louisa Willard, Psychologist for the 
New Haven Schools, Frank Carr, Principal of Troupe 
Junior High School, Teacher Connolly, and the eighth 
grade teachers, especially Teacher Himmelfarb. 
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line and identified at intervals of 10 dots by the follow- 
ing labels: like very much; like a lot; like a little; 
neither; dislike a little; dislike a lot; dislike very much. 
Ss’ responses were then scored from +3.0 for “like 
very much” to —3.0 for “dislike very much.” 

Experimental procedure. Each S was assigned, during 
school hours, to go to the Homemaking room of the 
school where he was met by the experimenter (Z). It 
was explained that the project was “consumer research” 
about various people’s preferences for different kinds of 
vegetables. E said that he wanted to ask the S to answer 
some questions about a vegetable picked at random 
(though it really had been selected as one quite dis- 
liked, at least —2.0 on the pre-experimental question- 
naire). After answering some preliminary questions, S 
was given the main questionnaire to fill out. The ques- 
tions, to be described in detail, were designed to 
measure liking for the vegetable and opinions about 
its food value. 

On completion of the main questionnaire, S was told 
that his answers would be very helpful but that a person 
sometimes gave different answers if he first tasted the 
vegetable. It was further explained that S need not do 
anything further, since he might not like the idea of 
eating some of the vegetable, but that he would be 
given a reward if he chose to eat the vegetable and 
answer further questions about it. At this point the 
experimental manipulation of amount of committed 
eating was introduced. 

Commitment to little eating (Low Eating condition) 
was accomplished by telling S that to obtain the reward 
he must eat a single small dish of the vegetable imme- 
diately. Commitment to greater eating (High Eating 
condition) was accomplished by telling S that to obtain 
the reward he must eat a small dish of the vegetable 
immediately and also return three or four times within 
a few weeks, each time to eat more. All experimental 
Ss were offered the same reward, a gift certificate good 
for their own choice of two 45 rpm or 78 rpm records, 
or two tickets for movies of their own choice. Ss were 
assigned at random to the two commitment conditions. 
It should be noted that the overall rate of refusal to eat 
was only 5% and did not differ between conditions. 

If S agreed to continue, he was given a small dish 
of the vegetable, heated if appropriate, and allowed a 
few minutes in which to eat it. When finished, the S was 
given a “research report that he might be interested 
in reading.” It was ostensibly from the laboratories of 
the testing organization and purported to give the food 
vaiue of the vegetable. Actually, each report was 
designed to give information in support of (Support 
condition) or against (No Support condition) eating 
the vegetable. The following report, for example, was 
designed to support the eating of spinach. 

Vegetables have two important values in a per- 
son’s diet—they furnish most of the minerals and 
most of the vitamins needed for strong bodies and 
active minds. 

Our tests of spinach show that this vegetable is 
particularly strong on vitamins. An average size 
portion will furnish 95% of the daily quantity of 
vitamins needed for building new body tissues and 
mending injured tissues and bones. Spinach is prob- 
ably the best single vegetable to supply vitamins. 
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Other reports in the Support condition were identical 
except for the name of the vegetable. Reports in the 
No Support condition were identical except that they 
said “weak” instead of “strong,” “only 5%” instead 
of “95%,” and “worst” instead of “best.’’* The type of 
report given to each S was determined by chance. 

When S had finished reading the report he was asked 
to fill out again the main experimental questionnaire 
(identical to the “before” questionnaire). In making 
the request, Z mentioned that some people change one 
way, some another, and some don’t change at all. He 
also reaffirmed the anonymity of the S’s answers. 

Upon completion of the questionnaire, S was allowed 
to choose his reward and was asked not to talk about 
what he had done when he returned to class. Shortly 
after the completion of the project, Z met with all the 
Ss in a group and explained the deceptions and true 
purpose of the project. 

Control procedure. In order to estimate any direct 
effects of the communications without a commitment to 
eat, a classroom experiment was performed. For this 
test 8th graders from a different school had to be used.‘ 
The second school is not far from the first and draws 
its students from a population with similar socioeco- 
nomic status. The same preliminary questionnaire was 
given, though this time by the Z and only one week 
prior to the experimental session. A set of materials 
was prepared for each S, and consisted of the main 
questionnaire under which was a sealed booklet made 
up of either a Support or No Support communication 
for a vegetable disliked by the S, and a second copy of 
the main questionnaire. These were passed out to the 
students in class and E explained he was trying to 
obtain more detailed information by asking each person 
several questions about one particular vegetable whose 
name he would find on his questionnaire. Ss were asked 
first to tear off their names, which were on separate top 
sheets. The questionnaires, thus, looked anonymous 
although they were coded on the back. Z then asked Ss 
to fill out the first questionnaire carefully but not to 
unseal the booklet until told to do so. When the first 
questionnaires were completed and passed in, EZ ex- 
plained that while this information would be good, he 
had also found that people sometimes gave different 
answers if they thought carefuily about the vegetable 
first, trying to imagine as vividly as possible just how 
it tastes, smells, and feels in the mouth. So, he said, he 
was also asking them to do this and then to answer some 
further questions they would find on the inside of the 
booklet. Also, on the inside of the booklet they would 
find a laboratory report about their vegetable in which 
he thought they would be interested, and they should 
read that before answering the questionnaire. After all 





* About half of the Ss received similar communica- 
tions concerning mineral instead of vitamin content. 
Since this type of communication failed to affect liking 
either directly or in conjunction with commitment, the 
results for these subjects are omitted from this report. 

*The author wishes to thank Louisa Willard, 
Principal Corrigan, Principal of Truman School, and 
Teachers Walker, Smernoff, and Kelly, at Truman 
School, for their help in conducting this part of the 
study. 
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final questionnaires had been completed and turned in, 
the true purpose of the experiment was explained. 

The main experimental questionnaire. This question- 
naire included one question identical to that used on the 
prequestionnaire, which read, “Considering everything 
about it, (taste, smell, appearance, texture) to what 
extent do you like or dislike this vegetable?” It was 
answered on a scale like that used on the prequestion- 
naire. A second question pertained to the food value of 
the vegetable, and read, “How high would you guess 
the general vitamin content of this vegetable compared 
to others?” The answering scale was similar to that 
used for the previous questions except that the identi- 
fying labels were: very high; fairly high; slightly above 
average; just average; slightly below average; fairly 
low; very low. 


RESULTS 


Commiiment and change in belief. While the 
content of the communications was direct and 
forceful and the source was highly authorita- 
tive, Ss might still manage to believe them 
differentially. According to dissonance theory, 
the greater the magnitude of dissonance being 
experienced, the greater the seeking of informa- 
tion that can reduce dissonance and the avoid- 
ance of information that might, increase it. 
Thus, persons in the High Eating condition, 
more than those in the Low Eating or Control 
conditions, should tend to raise, or resist 
lowering, their estimate of vitamin content. 

Mean changes in estimated vitamin content 
are given in Table 1. The means for all three 
conditions show a decrease in the No Support 
and an increase in the Support condition, the 
difference being statistically reliable for each. 
It is clear that the manipulation of Support and 
No Support was successful. Mean changes in 
the High Eating condition, however, show less 
reduction in estimated vitamin content from 


TABLE 1 


MEAN InrTIAL ScoRES AND CHANGES IN 
EstmmaTep VITAMIN CONTENT 








Communication 





Commitment Support 


| Initial| Change (N) Change (N) 


No Support 





Initial 

.58*| —3.02 (11)| .84 | 0.92 (10) 
Low Eating | .45| —2.35 (8) | 1.76 0.77 (7) 
High Eating | 1.23| —0.34 (7) | .46 | 1.66 (10) 














* Comparisons of the initial means do not yield significant és, 
indicating that the initial scores are somewhat unstable witr Ns 
this small. 
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the No Support communication, significantly 
different from the Low Eating condition at the 
.07 level, and from the Control condition at 
beyond the .01 level.® Similarly, the increase in 
estimate from the Support communication is 
greater in the High Eating condition than in 
either Low Eating or Control conditions, 
though not significantly so. If the effects of the 
Support and No Support communications are 
averaged, the High Eating condition is signifi- 
cantly different from the Low at the .04 level, 
and from the Control at beyond the .01 level. 
It is apparent, then, that Ss tended to believe 
supporting information and disbelieve non- 
supporting information in proportion to the 
amount of eating to which they were com- 
mitted. 

Effect of support on liking. The communica- 
tions were given in a class room situation to 
estimate their effect on liking for the vegetable. 
It was assumed that any such effects obtained 
in the Control condition would be at least as 
large in magnitude for the individually tested 
Ss is the main experimental conditions. 

Mean changes in liking (from the “before” 
to the “after” main experimental question- 
naire) were computed for the Control con- 
ditions. These changes, shown in the first row 
of Table 2, indicate that the Support com- 
munication produced increased liking, the No 
Support decreased liking, the difference be- 
tween the conditions being significant at the 
05 level. 

Effect of commitment and support on liking. 
Since the Control condition showed communi- 
cation support to increase liking while the dis- 
sonance hypothesis holds that mensupport in- 
creases liking, the two effects are working 
against each other. Thus, in the Low Eating 
condition, where the magnitude of dissonance 
is low, we might expect the support manipula- 
tion to have a direct effect on liking, similar to 
that in the Control condition. But in the High 
Eating condition, where the magnitude of dis- 
sonance is greater, the effect of attempts to 
reduce dissonance should reduce or even re- 
verse the direct effect of the support manipula- 
tion on liking. 

Mean liking change scores for each experi- 
mental condition are presented, along with the 
Control data, in Table 2. The direct effect of 


5 All » values are from two-tailed ¢ tests. 
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TABLE 2 
Mean InrTIAL LIKING AND CHANGES FOR 
EACH SITUATION 


; 


Commitment 











Initial Change 


—1.56] . 
— .66 | 1.97 
6 | -48 


| —.s2e | —.43e 
}-.99 | ii 
| — .03 1.33 


Control 
Low Eating 
High Eating 
* None of the below comparisons yields a significant é for initial 
scores. 
> » values for various comparisons of change scores: 
Control (No Support-Support) = .0S 
Low Eating (No Support-Support) = .06 
High Eating (No Support-Support) = .09 
No Support (High Eating-Control) = .02 
No Support (High Eating-Low Eating) = NS 
Support (Low Eating-Control) = <.01 
Support (Low Eating-High Eating) = <.C1 
Low Eating (No Support-Support)—High Eating (No Support- 
Support) = .02 
Control (No Support-Support)-High Eating (No Support- 
Support) = .03 








the support manipulation occurs in the Low 
Eating condition: increase in liking is greater 
for the Support than for the No Support com- 
munication (significant at the .06 level). And 
as expected from the dissonance hypothesis, 
the difference between Support and No Support 
conditions with the High Eating manipulation 
is reversed. While the difference is significant 
at only the .09 level, it is significantly different 
from that for the Low Eating condition at the 
.02 level, and from the Control condition at 
the .03 level. It is clear, therefore, that the 
direct effect of supporting information on in- 
creased liking tends to be reversed to the extent 
that there is behavioral commitment. 

It will also be noted in Table 2 that com- 
mitment has no over-all effect on liking. The 
Low Eating conditions produced as much in- 
creased liking on the average as did the High 
Eating. These results therefore fail to provide 
adequate support for the hypothesis that in- 
creased liking is a simple function of amount of 
eating committed. 

DISCUSSION 

This experiment has shown that a person 
induced to perform a disliked behavior tends 
to increase his liking for the behavior as a 
joint function of the amount of the behavior 
to which he is committed and the presence of 
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further supporting or nonsupporting informa- 
tion. The major results are in support of 
Festinger’s theory of cognitive dissonance. 
When a person is induced to perform a disliked 
behavior, additional information can decrease 
or increase the magnitude of dissonance by 
supplying information that is respectively for 
or against engaging in the behavior. He at- 
tempts to reduce the dissonance, in proportion 
to its magnitude, by increasing his liking for 
the behavior. Thus, he increases his liking for a 
behavior to which he is committed to the extent 
that he is exposed to information against en- 
gaging in the behavior. 

While the predicted interaction was ob- 
tained, the manipulation of amount of com- 
mitted eating failed to produce an over-all 
effect on liking. Average change in liking in the 
Low Eating condition was as great as in the 
High. One possible reason for this unexpected 
result is that Ss in the High Eating condition, 
who were expecting to return again, were more 
restrained in their general reaction to the situ- 
ation. Another possibility is that the Eating 
manipulation was not sufficiently strong to 
provide an overall effect; only in conjunction 
with the No Support communication did it pro- 
duce dissonance. The latter interpretation, 
however, would not explain why with the 
Support communication the Low Eating con- 
dition produced greater increased liking than 
did the High Eating condition. A third possible 
interpretation is that the pressure to change 
liking is a function of the discrepancy between 
behavior and information. That is, where be- 
havior is minimal and informational support 
high, or behavior is great and support low, 
there is pressure to increase liking. When, 
however, both behavior and support are great, 
the pressure to increase liking is low. Thus, it is 
suggested that dissonance may be created by 
exposure to supporting information, and may 
be reduced interchangeably by commitment 
to behavior or by increased liking. This im- 
plication of the theory is not developed in 
Festinger’s statement (1957). 

The experiment, it shouid be noted, was con- 
cerned with only one possible channel of dis- 
sonance reduction. It assumed that attitude 
change toward the induced behavior would be a 
major mode of reduction under the present 
conditions, though other modes were available 
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to subjects and may have been used to some 
degree. While this assumption was supported 
by the positive findings of the study, it might 
not prove to be a good assumption in other 
settings. As yet, dissonance theory does not 
completely specify the factors controlling the 
choice of channel through which dissonance is 
reduced. Making the theory operational there- 
fore requires taking such extratheoretical 
factors into account. There is here a clear need 
for theoretical refinement and further research. 
Meanwhile, the results of the present experi- 
ment can be generalized to other situations 
only with due caution. 


SUMMARY 


This study tested a “nonobvious” hypothe- 
sis derived from Festinger’s theory of cognitive 
dissonance: given the fact that a person is com- 
mitted to an unpleasant behavior, he tends to 
increase his liking for that behavior more if he 
is exposed to information against engaging in 
it than if he is exposed to information favorable 
to engaging in it. This tendency is anticipated 
to be in proportion to the amount of such be- 
havior to which he is committed. 

The hypothesis was tested by inducing 


junior high school students to commit them- 
selves to eating different amounts of disliked 
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vegetables. Fictitious research reports to which 
they were exposed said their vegetable was 
either high or low in food value. Estimates of 
food value and liking for the vegetable were 
measured before and after the experimental 
inductions. 

It was found that the communications about 
food value tended to be believed or disbelieved 
as a function of the amount of eating to which 
the person was committed. In addition, the 
main experimental hypothesis, as stated above, 
was supported. 
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in the realm of verbal behavior (e.g., 
Hildum & Brown, 1956; Verplanck, 1955; see 
Krasner, 1958, for review). Stimuli demon- 
strated to be reinforcing have been verbaliza- 
tions of assent, agreement and approval, 
nods, smiles, paraphrases and impersonal 
events such as buzzer and bell tones and colored 
lights. Attention, although hypothesized by 
Skinner to be a generalized reinforcer (Skinner, 
1953, 1957) has not been specifically investi- 
gated. 

The purposes of this study were (a) to deter- 
mine whether a combination of attention and 
assent constituted a reinforcing event, (5) to 
explore the effects of prolonged extinction on 
the occurrence of self-reference statements and 
other behavior such as emotional reactions an: 
escape behavior, and (c) to determine what are 
the implications of the use of common events 
(attention-assent) in free, unstructured inter- 
view situations. In most previous studies ex- 
tinction has not been carried out at length. In 
Verplanck’s investigation, for example, ex- 
tinction was carried out for only 10 minutes, a 
period too brief to uncover any effects other 
than those upon the class of operants con- 
sidered (Verplanck, 1957). 


ECENT experiments have demonstrated 
applications of operant conditioning 


METHOD 
General Experimental Plan 


The experiment was conducted under the guise of 
being a survey of student opinion with two groups of 
31 Ss, one experimental group and one control group. 
The survey interview was divided into four consecutive 
periods, the first three each lasting 10 min., while the 
last varied in length up to a maximum of 20 min. Each 
period was further subdivided into 2-min. intervals 
during each of which S’s self-reference statements and 
other statements were counted. After the introduction 
of the “survey” and after S understood what was 
wanted of him, the first 10-min. period began and the 
operant level of statements containing self-references 





1 The assistance of Beverly A. Brown, George E. 
Dunn, Ben V. Hole, Frank B. Krasne, Lillian Lauer, 
Jack Lyle, and Aline Spivock is gratefully ac- 
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was determined, no reinforcement being given in either 
group. 

During the second 10-min. period Z reinforced all 
statements containing self-references made by experi- 
mental Ss by following their occurrence by a combina- 
tion of attention and assent. A precise description of 
reinforcing events is given below. Control Ss received 
no reinforcement in Period II, nor in any other period. 

Extinction was conducted during the third 10-min. 
period and during the subsequent 20-min. period. As 
will be seen, some Ss stopped responding entirely during 
the course of extinction. Extinction consisted of with- 
holding from experimental Ss the reinforcement pre- 
viously given in Period II. 

If at any time during the course of the experiment S’s 
over-all rate of responding fell to zero, nondirective 
probes such as “What else can you think of?”, “Can 
you think of anything else?”’, or “Do you have any 
other ideas?” were used. These were infrequently neces- 
sary except during extinction and then they were often 
unsuccessful in maintaining continued responding. 


Subjects 


The experimental] Ss were 31 male and female, grad- 
uate and undergraduate students at Stanford Univer- 
sity, who were “interviewed” in the spring of 1958. 
Control Ss were drawn from the same population but 
were “interviewed” in the spring of 1959. It was ascer- 
tained at the conclusion of the experiment, when the 
real purpose of the “interview” was explained, that all 
Ss were naive to the extent that they had no training in 
conditioning and had never before participated in an 
experiment on operant conditioning. 


Experimenters 

The Es were advanced undergraduate and graduate 
students in two of the senior author’s courses on lan- 
guage and verbal behavior. Seven of these collected the 
experimental group data; two collected the control 
group data. One E collected data from both groups. In 
addition to having some theoretical sophistication in 
verbal operant conditioning, each Z had a minimum of 
four complete practice sessions followed by a detailed 
critique period. The Z who collected data from both 
groups had, in addition, one year’s intensive practice 
in verbal conditioning in connection with a number of 
other studies. He acquired considerable sensitivity in 
identifying statements and self-reference statements 
and imparted this skill to the other Z who collected 
control data and for whose training he was responsible. 
As a consequence, to which reference will be made 
below, rates of response show an elevation in the control 
group data as compared to the experimental group data. 

In view of the fact that there is a difference in the 
sensitivity of the experimental and control group Zs, it 
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would have been desirable to obtain interexperimenter 
reliability data. Unfortunately, any useful reliability 
data would have had to be based on the reinterview of 
the same Ss by different Zs and this, of course, would 
have had an effect on the observed responses. Our feel- 
ing is that Es of equivalent training would have yielded 
acceptably high reliabilities. Since the seven Es who 
collected the experimental group data had comparable 
training and the two who collected control group data 
had equivalent training, the inter-Z reliabilities within 
the two groups are probably acceptable. And this is 
really the important point, for we are especially con- 
cerned with change or lack of change within each group. 


Procedure 


The experiment was conducted at the convenience of 
Ss in their living quarters or other quiet place at a time 
when only Z and S were present and when S had one 
full hour available. When the experiment proper began, 
E was seated in front of S and recorded the frequency 
of self-reference statements and all statements at the 
end of each 2-min. period. 

In his introduction to S, E told him that he was 
doing a “survey” for the psychology department, that 
the purpose was to find out students’ “frank and honest 
personal opinions and ideas about conformity and 
education in America,” and that S could talk for about 
an hour or more on the subject. S was further told that 
E would not enter into the interview except to take 
notes. S was then reminded that the interview would 
be most fruitful if a complete, detailed and exhaustive 
expression of his ideas and opinions were obtained. 

The topic “conformity and education in America” 
was selected for several reasons. First, it was a topic 
students frequently discussed and about which they 
held quite definite opinions. Thus, it was felt that 
there would be some motivation to express views on it. 
Secondly, the subject was broad enough to permit sus- 
tained talking. Thirdly, it was a topic which neither 
imposed an upper nor a lower limit on the use of state- 
ments of self-reference. 

Except during the 10-min. reinforcement period for 
the experimental Ss, Z kept his entire attention on his 
notebook and was silent. He did not lift his head, shift 
his line of regard to S, or make any verbal responses, 
unless S stopped talking. When the latter occurred Z 
used a general, nondirective probe in order to try to 
maintain S’s output. 


Reinforcing Stimulus Events 


A single class of stimulus events was hypothesized to 
have reinforcing properties. The event consisted of 
E’s (a) lifting his head, (6) shifting his line of regard 
from his notebook to S, and (c) murmuring “mm-hmm” 
simultaneously. After a reinforcement (which took 
about 1 sec.) Z returned to his original position and 
composure until S emitted another statement contain- 
ing a self-reference. As previously indicated, it is only 
among experimental Ss in Period II that reinforcement 
of self-references was used. Among control Ss, Z was 
silent throughout the experiment and kept his attention 
on his notebook. 
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The Class of Operant Responses Conditioned 


The verbal operants selected for conditioning were 
all statements containing one or more references to S$ 
personally. Thus, statements having “I,” “me,” “my,” 
“mine,” “myself” and like words were considered self- 
reference statements, except in those instances when 
the pronoun had someone else than S as a referent 
(e.g., “nay roommate thinks...”). Statements con- 
taining the first person plural pronoun, “we,” were also 
counted as self-reference statements if they pertained 
to an activity in which S himself had participated, or a 
belief he shared. 

Statements were defined as units of “thought,” con- 
taining a complete idea. Statements, therefore, were 
complete sentences as defined in standard dictionaries, 
although in speech they might not be clearly punc- 
tuated as a written sentence. Such a definition is not 
entirely satisfactory, for the sufficient and necessary 
criteria are not enumerated, and there is, therefore, 
room for subjective variability on the part of Es. Es, 
however, becom: more proficient with experience and 
there is a tendency for them to identify more self-refer- 
ence statements as ti:eir proficiency increases. 


RESULTS 
Effects of Reinforcement 


The basic index of responding used in the 
analysis is the ratio of self-reference state- 
ments to total statements made (called SR 
ratio hereafter). The mean SR ratios for each 
period and the mean SR ratio over-all periods 
are plotted in Fig. 1 for the experimental and 
control Ss. 

The first point to be noted is that the mean 
over-all SR ratio in the experimental group is 
305, whereas it is .408 in the control group. 
The difference is significant beyond the .001 
level and represents increased proficiency by 
one £ in identifying statements and self- 
references as he gained experience during a 
period of approximately one year, as previously 
related. 

As Fig. 1 shows, the size of the SR ratio in- 
creases during reinforcement and drops during 
extinction in the experimental group, as pre- 
dicted under the hypothesis that attention- 
assent constitutes a reinforcing state of affairs. 
To determine the significance of the observed 
change in SR ratios two analyses were per- 
formed. First, Cochran’s Q test (Siegel, 1956) 
was applied to determine whether the values 
of SR deviated significantly from the mean 
over-all SR value of .305. This required a deter- 
mination of whether each individual ratio devi- 
ated above or below that value. The Q value 
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Fic. 1. SR ratio by period for experimental and 
control subjects. 


obtained was 9.38, which is significant beyond 
the .05 level. Among the controls the SR values 
during each period did not differ significantly 
from the over all mean SR value of .408 
(0 = 2.68). 

[he second analysis, Friedman’s non- 
parametric analysis of variance (Siegel, 1956), 
reveals that the treatments contributed signifi- 
cantly to the variance among experimental Ss 
(x = 16.7) but not among control Ss 


(x = 5.15). Further analysis of the data for 
the experimentals revealed that the reinforce- 


ment treatment is the major contributor. Wil- 
coxon’s signed-ranks analysis for matched 
pairs (Siegel, 1956) indicated that the SR ratio 
during reinforcement among experimental Ss 
was significantly greater (@) than the SR ratio 
during the operant level period (s = 2.5), 
(6) than the SR ratio during the first extinction 
period (s = 2.9), and (c) than the SR ratio 
during the second extinction period (g = 3.1). 

The rate of emitting self-reference state- 
ments and all statements is shown in Fig. 2. 
Among the experimental Ss the rate of self- 
reference statemenis increases significantly 
from 1.42 to 1.90 self-reference statements per 
minute during the reinforcement period 
(s = 2.98, by Wilcoxon’s test), while the rate 
of emission of all statements remains virtually 
constant. Among the control Ss both the rate 
of self-references and the rate of all statements 
drops significantly during the corresponding 
period (z = 3.20 and z = 3.13, for self-reference 
and all statements, respectively). 


Effects of Extinction 


The SR ratio among experimental Ss de- 
creases significantly following reinforcement, 


during the first extinction period ( = 2.9). In 
the second extinction period, lasting up to 20 
min., there is no further decrease in the SR 
ratio (Fig. 1). However, the rate of self- 
reference statements decreases continuously 
during extinction, as may be seen in Fig. 2. By 
Wilcoxon’s test, the rate of self-reference state- 
ments during the first extinction period is 
significantly smaller than during reinforce- 
ment (s = 3.4) and is significantly smaller 
during the second extinction period than during 
the first (¢ = 3.08). The rate of making all 
statements is not significantly smaller during 
the first extinction period than during the re- 
inforcement period ( = 1.27). However, the 
rate is significantly smaller during the second 
extinction period than during the reinforce- 
ment period, (g = 4.3). Furthermore, the rate 
during the second extinction period is smaller 
than during the first extinction period 
(s = 3.96). Therefore, although the SR ratio 
during the second extinction period is not 
smaller than during the first extinction, the 
rate of emitting self-references and total state- 
ments decreases during successive extinction 
periods. 

The rates of self-reference and all statements 
for control Ss do not vary significantly after 
the first 10-min. period of the “interview.” The 
extinction effects noted among experimental 
Ss are in no way evident among the controls. 
Hence, the observed experimental extinction 
effects are attributable to withdrawal of 
generalized reinforcement. 
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Fic. 2. Rate of emitting self-reference and all state- 
ments per minute for experimental and control subjects. 
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Many experimental Ss displayed general 
disturbance, emotionality, and, in some 
instances, downright anger and hostility 
toward E during extinction. This was especially 
noticable in the late stages of extinction. In 
her report on her “interviews,” one of the ex- 
perimenters wrote: “...many Ss found the 
interview to be a stress situation (during ex- 
tinction).” This finding is in accord with re- 
ports by Verplanck (1955). There was further 
evidence that some Ss “left the field”—.e., 
refused to continue speaking, despite prompts 

—because the discontinuance of reinforcement 
was aversive to them. It is as though non- 
reinforcement constituted punishment for them. 
However, it is not possible to say that all Ss 
whose verbal behavior stopped before the end of 
the second extinction period did so because of 
aversion. In only one S was there evidence of 
emotionality among the controls, though a 
number of them stopped responding before the 
end of the 50-min. experimental session, despite 
the use of probes, apparently having run out of 
comments on the subject of conformity. 

In Fig. 3 the number of Ss who continued to 
respond during each 2-min. interval of the ex- 
tinction period is plotted. At the beginning of 
the extinction period all experimental Ss were 
responding; during the corresponding period 
twenty-eight of the thirty-one controls were 
responding. The difference is not significant by 
Fisher’s exact test (P = .12, one-tailed). At 
the end of the extinction period, however, nine 
experimental Ss and sixteen control Ss were 
responding. The difference is significant beyond 
the .05 level (x? = 3.28, one-tailed). 


Evidence of Awareness 


Upon questioning at the end of the experi- 
ment not one experimental or control S re- 
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ported that the “interview” situation was other 
than was related to him. Some experimental Ss 
reported “awareness” of change in E’s be- 
havior during the reinforcement and extinction 
periods, this change being one of “interest” and 
“attention.” None, however, were able to 
report on what these changes were contingent 
—i.e., emitting self-reference statements. We 
may conclude, therefore, that the observed 
experimental effects were not a function of 
knowledge or awareness of the experimental 
manipulations. 


DISCUSSION 


The evidence is unequivocal that the fre- 
quency of emitting self-references can be con- 
trolled by making attention-assent contingent 
upon their occurrence. When attention and 
assent follow, self-references increase (relative 
to total output); when attention and assent 
are withdrawn, the relative frequency of self- 
references decreases. The same is true for the 
rate of emitting. self-reference statements. 
These effects are not observable among the 
controls, whose self-reference statements were 
not reinforced at any time. Further evidence of 
the effects of the experimental manipulations 
comes from the fact that all verbal behavior is 
extinguished in significantly more experimental 
than control Ss. 

Four explanations of the latter over-all ex- 
tinction phenomenon suggest themselves. First, 
it is plausible that Ss stop responding because 
they have exhausted all that they can say on 
the subject of conformity and education in 
America. Though this is undoubtedly true in 
the case of some Ss, since a number of control 
Ss stopped responding, such an explanation 
does not account for the fact that the experi- 
mental Ss are more likely than the control Ss 
to stop responding before the termination of 
the experiment. Nor does it account for the 
experimental Ss’ greater emotionality during 
extinction. Second, it is possible that state- 
ments without self-references were extinguished 
along with those containing self-references be- 
cause of generalization. Both types of state- 
ments are quite similar in that both are verbal 
responses and it is therefore possible that ex- 
tinction of self-reference statements generalizes 
to other statements. However, in the present 
instance the explanation appears improbable 
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because, if there were stimulus generalization, 
it should have been evident during reinforce- 
ment. But there was no evidence for stimulus 
generalization during reinforcement, for only 
the frequency of self-reference statements and 
rate of emitting them increased, while total 
output and rate of total output remained 
constant. 

A third possibility is that a minimum number 
of self-reference statements is necessary if 
talking on the selected topic is to be possible. 
There is no direct evidence in support of this 
explanation. However, the data in Fig. 1 in- 
directly suggest that it may have validity. 
After prolonged extinction the SR ratio among 
experimental Ss drops only to the operant 
level in those Ss who continue talking. Thus, 
approximately .270-.300 may be the minimum 
possible ratio if talking is to continue. If the 
proportion of self-references drops below this 
level (because of extinction) talking may be- 
come impossible or at least very difficult. 
Further support for this hypothesis comes from 
the fact that while the SR ratio remains con- 
stant during extinction, the ra/e of self-refer- 
ences and total output steadily decreases. 
Among the controls, on the other hand, both 
the SR ratio and the rate of self-reference 
statements and of total output remain constant 
during the corresponding periods. 

A fourth possible explanation of the extinc- 
tion of all verbal behavior is that withdrawal 
of reinforcement constitutes punishment and 
that this punishment has the effect of arresting 
all verbal behavior (and perhaps other be- 
havior) in the presence of Z. If punishment is 
defined as “an operation in which an aversive 
or conditioned aversive stimulus is made con- 
tingent upon a response” (Ferster & Skinner, 
1957, p. 731), the explanation seems reasonable, 
for silence is probably used by the verbal 
community as a conditioned aversive stimulus. 
As Skinner points out, remaining silent when 
the occasion usually gives rise to speech con- 
stitutes punishment (Skinner, 1957). Thus, in 
the present experiment the Z’s withholding of 
attention and assent, following reinforcement, 
may have had an effect very similar to the 
silence following the commission of a verbal 
faux pas in a social group. That some experi- 
mental Ss who stopped responding may have 
perceived the situation in precisely this way is 
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evidenced by their reporting that they felt the 
E had become uninterested in or even annoyed 
at what they were saying and by the fact that 
they frequently became visibly tense, angry, 
and hostile. 

Of the four possible explanations suggested, 
the data rule out that relating to the generaliza- 
tion of extinction. Running out of comments 
to make may partially account for the Ss stop- 
ping to respond altogether, for some Ss in the 
control group, as well as in the experimental 
group, stopped before 50 min. elapsed. How- 
ever, it could not account for the significantly 
greater proportions of experimental Ss who 
stopped. This disparity may be accounted for 
by either the third and fourth explanations 
offered, or, perhaps, by both. The data do not 
permit a choice between the two, however. 

With respect to interviewing of the free, un- 
structured type, the findings of this study have 
clear implications. Attention and assent—both 
common interviewer responses—can produce 
significant changes in the content of a re- 
spondent’s verbalizations. In this study, the 
effect was to increase the frequency of self- 
reference statements. When one member of a 
dyad differentially reinforces specific verbal re- 
sponse classes of the other participant, he 
gains some control over the conversation. Such 
control has obvious advantages. Psycho- 
therapists may use verbal reinforcement tech- 
niques to guide their patients toward significant 
insights; these procedures may be of consider- 
able importance, for instance, when applied to 
self-reference, affect, or other statements where 
therapeutic circumstances preclude direct re- 
quests for such information from the patient. 
Similarly, survey interviewers can control the 
content of a respondent’s verbalizations in a 
direction consonant with the purpose of an 
interview by use of differential reinforcement. 
Or, a physician might find it useful to reinforce 
differentially in his patients verbal descrip- 
tions of symptoms, as opposed to diagnostic 
inferences. 

Perhaps more importantly, interviewers 
should be aware of the possible undesirable in- 
fluence which verbal reinforcement may have 
upon the responses of interviewees. Inappropri- 
ate use of such techniques may easily lead to 
selective biases in the data or clinical evalu- 
ations based upon refiected idiosyncracies of 
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the interviewer which have been reinforced in 
a respondent or patient. Furthermore, the 
withdrawal of reinforcement, as in the present 
study, may result in emotionality that might 
have undesirable consequences, particularly in 
psychotherapeutic situations. 


SUMMARY 


An experiment was conducted to determine 
whether the use of a combination of attention 
and assent by an “interviewer” constituted a 
reinforcing event and to find out the effects of 
prolonged extinction. Following a 10-min, 
period during which the operant level of self- 
reference statements was measured. all self- 
reference statements by experimenta: Js were 
reinforced by E’s showing attention-assent for 
a period of 10 min. The Ss’ responses were 
then extinguished for a period of as long as 30 
min. The same procedure was followed with 
control Ss with the exception that they received 
no reinforcement. 

It was found that attention-assent consti- 
tuted a reinforcing state of affairs since the 
relative frequency and rate of self-reference 
statements significantly increased during con- 


ditioning among the experimental Ss, while 
control Ss showed no change. Extinction of self- 
reference statements produced a significant 
decrease in their relative frequency and rate of 
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occurrence among experimental Ss, but not 
among control Ss. 

During extinction most experimental Ss 
showed emotional disturbance; in some cases 
anger and hostility. The verbal behavior of sig- 
nificantly more experimental than control Ss 
was totally extinguished before the end of the 
30-min. extinction period. In some instances it 
was evident that Ss were leaving an aversive 
field. Possible explanations of the observed ex- 
tinction phenomena were offered. 

There was no evidence that any S was aware 
or had knowledge of the experimental manipu- 


lations. 
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a series of measures that distinguished 

between groups with homogeneous atti- 
tudes discussing a persuasive film on the topic 
of ethnic minorities (McGinnies & Altman, 
1959). The present study applied these and 
additional measures to groups differing in atti- 
tudinal composition so as to determine the 
relationships existing between group structure 
and discussion behavior. Certain of Festinger’s 
assumptions concerning forces to communicate 
in groups were also testable in the design of 
the experiment. 

The structural properties of groups, and par- 
ticularly the relative homogeneity of member 
orientations toward an issue, have been shown 
to influence perceptual and judgmental proc- 
esses of the members (Sherif, 1932; Asch, 
1952). In many of these studies, no systematic 
analysis of behavior was undertaken during 
the actual process of social interaction. We 
have attempted to follow influence pressures 
in different group structures throughout the 
process of member interactions, as well as to 
describe some relationships betweea group 
composition and interpersonal perception. 

Recent studies (Festinger, 1950; Festinger 
& Thibaut, 1951; Festinger, Back, Schacter, 
Kelley, & Thibaut, 1950) have focused on the 
process of group functioning and have analyzed 
changes in member behavior as they influence 
and are influenced by others in the group. 
In general, these studies support the hypothe- 
sis of a positive relationship between degree of 
opinion discrepancy concerning an issue and 
the relative strength of pressures to communi- 
cate with those holding deviant opinions. In 
the majority of these studies, however, the 
relative balance of divergent opinions within 


T A recent paper, the investigators reported 
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groups has not been independently controlled. 
For example, some studies use four-man 
majorities facing one-man minorities, while 
others have much larger majorities. Since 
group composition has been shown to be an 
important determinant of member behavior, 
we controlled the degree of opinion discrepancy 
in groups by systematically varying majority- 
minority conditions. 

Several additional differences between the 
present study and others dealing with intra- 
group pressures should be noted. Most studies 
in discussion settings have used situations 
where members have had to rely on others in 
the group for confirmation of their views. In 
the present study, certain members of the 
group found their views supported by the 
content of a persuasive film presented as a 
topic for discussion. In addition, earlier studies 
have defined opinion discrepancy as the degree 
of separation of members along an attitude 
continuum. Rather than manipulate the 
degree of distance between individuals in each 
group, we varied the number of individuals 
who represented extreme ends of an ethnocen- 
trism continuum. 

Briefly, the aims of the present study were 
as follows: 

1. To explore the effects of various degrees 
of minority-majority conditions on member 
behavior during the course of group 
discussions. 

2. To determine the relationships among 
attitudes, interpersonal perceptions, and forces 
to communicate within the different types of 
groups. 

METHOD 
Independent Variables 

A slightly modified version of the California E scale 
was administered to 500 male students of the junior 
and senior classes at two high schools.’ On. the basis of 


* We are grateful to the faculty and staff at Bladens- 
burg and High Point Senior High Schools, Maryland, 
for their assistance in the conduct of this research. 
John Lesser, Robert Lana, and Richard Page assisted 
in the collection of data and management of the dis- 
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their scores, Ss were randomly assigned to one of five 
types of discussion groups, each of which consisted of 
six members. A total of 25 groups were formed, five in 
each of the following structures: 
Low E homogeneous: All six members had low E 
scale scores. 
Low E majority: Four Ss were low Es; two were 


Balanced groups: There were equal numbers of high 

and low Es. 

High E majority: Four members were high Es; two 

were low Es. 

High E homogeneous: All six group members were 

high Es. 

Each group met on a single occasion 3-4 weeks after 
the initial attitude screening. The first part of the meet- 
ing was devoted to viewing a film dealing with the 
problem of minority groups. This film, “The High 
Wall,” takes an antiprejudice position in dealing with 
the conflict between teenagers of different ethnic back- 
grounds. Following the film, the group discussed the 
general problem of ethnic conflict. The discussion was 
led by a member of the research team who assumed a 
nondirective role and stimulated rather than guided 
the discussion. During the discussion, the investigators 
monitored recording equipment and coded the sequence 
and direction of participant responses. From both these 
“spottings” and recordings, w2 were able to compute a 
number of indices of discussion behavior. 

At the conclusion of each discussion, group members 
completed a questionnaire dealing with attraction to 
the group, attraction to particular group members, 
importance of the discussion topic, and perceived con- 
sensus of opinion. in addition, Ss were readministered 
the E scale. 


Dependent Variables 

In order to assess the effect of various group struc- 
tures on member behavior, three classes of dependent 
measures were used: 

1. Discussion Behavior. Several discussion behavior 
indices previously developed (Gerard, 1953; Dickens, 
1955; McGinnies & Altman, 1959) were employed. 
These measures required minimal inference by observers 
and were readily derived from the recorded discussions. 

(a) Spontaneiiy. In discussion situations character- 
ized by permissive leadership, opportunity is afforded 
group members to display varying degrees of initiative 
in their discussion behavior. The extent to which mem- 
bers act independently of leader stimulation is referred 
to as spontaneity. Operationally, a remark which 
occurred without leader probing was rated as spon- 
taneous, whereas member responses which followed 
leader questions, probes, etc. were defined as non- 
spontaneous. A percentage index of spontaneity was 
obtained by computing the number of spontaneous 
remarks relative to the total number of responses. 

(6) Rate of response. This general activity indicator 
is computed by determining the number of responses 
emitted by group members (or groups as a whole) per 
unit of time. 

(c) Spread of participation. In discussions where 
there is no requirement that members participate, it is 
probable that inequalities in response output will occur. 
To compare the homogeneity of verbal output in the 
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various structure conditions, a measure of spread of 
participation was employed (Dickens, 1955). 

(d) Direction of response. Since we recorded the 
originator and the recipient of every discussion com- 
ment, it was possible to chart the direction of communi- 
cation flow in the various groups. A modification of a 
measure developed by Gerard (1953) was used to 
represent response-direction scores. 

2. Altitude Change. The E scale was administered 
several weeks prior to the film-discussion session and 
again immediately following the discussion. Although 
our major aim was to assess variations in discussion 
behavior and interpersonal perceptions as a function 
of group composition, we used this opportunity to re- 
late discussion behavior to shifts in attitude. 

3. Questionnaire Data. In addition to the E scale, a 
questionnaire consisting of measures of attraction to 
the group, sociometric preference, relative importance 
of the discussion topic, and accuracy of interpersonal 
perception, was administered at the conclusion of each 
discussion. Some of the more interesting and significant 
findings emerged from analysis of these data. 


RESULTS 
Discussion Behavior 


(a) Spontaneity. This index, which refers to 
the degree of initiative of group members, was 
computed by dividing the number of spon- 
taneous participations by the total number of 
participations during each of three equal time 
periods. Indices were computed for each group 
as a whole as well as for high and low Es 
separately. Since the meetings lasted approxi- 
mately one-half hour, each index was based 
on about ten minutes of discussion. 

A repeated-measures analysis of variance 
indicated significant changes in spontaneity 
over the course of the meetings for all groups, 
as well as for high and low Es considered 
separately. All groups exhibited more initiative 
as the meetings progressed (Fig. 1). Although 
no reliable differences in spontaneity appeared 
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among the various structure conditions, the 
low Es were generally more independent of the 
discussion leader than the high Es. These 
results are consistent with those obtained 
earlier by McGinnies and Altman (1959). In 
addition, there was a tendency for the balanced 
groups to exhibit lower spontaneity scores 
than any of the other groups, suggesting that 
structures balanced with respect to opinion 
heterogeneity lead to a reduced level of member 
initiative. 

(b) Rate of response. A measure of discussion 
pace, or rate of output, was computed by 
dividing the number of participations in each 
of three time periods by the length of the 
time period and adjusting for the number of 
members in groups and subgroups. The index 
was computed for each total group, as well as 
separately for high and low Es. 

Analysis of response rates showed no sig- 
nificant changes over the course of a meeting 
for either high or low Es. Moreover, no sig- 
nificant variations in discussion activity 
occurred for the different group structures. 
While statistical evidence is lacking, the data 
of Figure 2 suggest that all group members 


quickened the place of their discussion behavior 
as the meetings progressed. As shown in the 
figure, the low Es had generally higher rates 


of participation than the high Es (¢ = 4.08, 
~ < .01). Thus, low Es tended to be both 
more active and less dependent on the ieader 
than the high Es. It is interesting to note, how- 
ever, that the rate of response difference 
stems largely from the heterogeneous struc- 
tures. When there is absence of opinion diver- 
gence on the discussion topic, there is 
practically no difference in participation rates 
of high and low Es. 

It should also be noted that the balanced 
groups again exhibited a unique pattern of dis- 
cussion behavior. Rates of response for these 
groups were lowest in spite of the fact that 
neither attitude subgroup was dominant. 
Festinger’s hypothesis (1950) might have 
predicted a maximum degree of both spon- 
taneity and discussion pace in such groups. 

(c) Spread of participation. To measure the 
relative equality of participations in the dis- 
cussion, an index developed by Dickens (1955) 
was used. This measure, based on the relative 
discrepancy between each individual’s response 
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output and the group mean, yields an indica- 
tion of the extent to which certain individuals 
dominated the discussions. 

While Kruskal-Wallis analyses of variance 
showed no differences in relative spread of 
participation among the various group struc- 
tures for either high or low Es, there were sig- 
nificant differences between the two attitude 
groups. Low Es tended toward equality in their 
individual contributions to discussions, while 
the high Es participation was dominated by a 
few individuals. 

(d) Direction of response. The directional 
flow of communications between members was 
computed by dividing the number of responses 
directed toward a given subgroup by the 
product of the number of possible recipients, 
the number of members in the communicator 
subgroup, and the length of a time period. 
These D scores were computed for each of the 
possible communication channels in the hetero- 
geneous structures: high E to low E; low E to 
high E; low E to low E; and high E to high E. 

These scores were analyzed by Friedman’s 
analysis of variance to ascertain temporal 
changes in communication flow and by the 
Kruskal-Wallis procedure to evaluate group 
structure and attitudinal differences in re- 
sponse direction. 

When analyzed temporally, no differences 
in direction of communication flow occurred. 
High and low Es did not direct their remarks 
differentially over time. However, there were 
significant differences in communication flow 
for each of the four channels between the vari- 
ous structure conditions. The results show, first, 
that there was a paucity of opposition directed 
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communication in the balanced groups. When 
no clearcut majority existed, members tended 
to direct their remarks largely to those partici- 
pants who held views similar to their own. When 
considered with the rate of response and spon- 
taneity data, we arrive at the picture of bal- 
anced groups being relatively lethargic or apa- 
thetic in their discussions. 

A second set of results indicate that low Es 
tended to increase communication toward 
both high and low Es as their number de- 
creased, i.e. as they moved toward the minority 
position. Conversely, high Es tended to 
decrease opposition-directed remarks as they 
progressed toward the minority position. Thus, 
there is evidence for the presence of an inter- 
action between attitude and group structure as 
determinants of influence processes. 


Altitude Change 


Since the E scale was administered to Ss 
several weeks before discussions and again 
after each meeting, it was possible to compare 
changes in attitudes across the different group 
structures. Analysis of covariance of post- 
discussion scores indicated that neither high 


nor low Es changed differentially in ethnocen- 
tric attitudes as a function of group structure. 
However, there was a significant decrease in 
ethnocentrism for pooled Ss (p < .01). 


Questionnaire Responses 


In addition to the E scale, participants com- 
pleted a short questionnaire following the 
discussions. The questionnaire covered a num- 
ber of general topics including (a) the extent 
to which members were attracted to their 
group, (6) the personal relevance of the dis- 
cussion topic, (c) the attraction of members to 
one another, and (d) the perception of opinions 
heid by other members. 

(a) Attraction to the group. Although low Es 
were least attracted to the balanced groups, 
they did not exhibit differential attraction to 
the various group structures (p > .05). On the 
other hand, high Es did show significant at- 
traction differences, with the lowest degree of 
positive feeling occurring in the balanced 
groups. Thus, both types of Ss were least 
positive in attraction to their groups in those 
structures where neither attitude subgroup 
was in the majority. One other general finding, 
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though not significant, was that low Es seemed 
to be more frequently attracted to their 
groups than the high Es. 

(b) Relevance of the discussion topic. Al- 
though there was a slight tendency for low Es 
to place greater importance on the discussion 
topic, neither predisposition group responded 
differentially to this item across the various 
group structures. 

(c) Sociometric choice. By having each S 
rank the other members with respect to 
personal attraction, we obtained another 
measure of the extent to which Ss perceived 
one another’s attitudes as these were revealed 
in the course of group discussion. 

Although analysis of variance showed no 
difference between the choice patterns of high 
and low Es, there were significant differences in 
choices among the various structure conditions 
and a significant attitude X structure inter- 
action. Examination of the data shows that the 
interaction results from the fact that the low 
Es were more apt to choose other low Es in 
both the minority and majority positions, 
while the high Es showed no appreciable differ- 
ences across structures. In other words, the 
low Es reacted sociometrically to differences in 
attitudinal composition of the groups. The 
high Es apparently were imperceptive of such 
differences, at least so far as sociometric rela- 
tionships were concerned. 

(d) Perceived consensus of opinion. We ob- 
tained a measure of the accuracy with which 
member attitudes were perceived by having 
each S identify those participants whom they 
thought held opinions similar to and divergent 
from their own. 

The results showed that low Es were gener- 
ally more accurate in identifying the attitudes 
held by other members. In addition, there were 
significant differences in the degree of ac- 
curacy for both high and low Es in the various 
structure conditions. Interestingly, the source 
of these differences was the same for both 
attitude groups. That is, Ss were most sensi- 
tive to attitudinal differences when judging 
from a minority position and were least sensi- 
tive in the balanced groups. 


DISCUSSION 


The results of the experiment may be ex- 
amined with respect to our initial aims as well 
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as to certain hypotheses suggested earlier by 
Festinger (1950). 

Festinger predicts, for example, that in- 
creased pressure to communicate to others in a 
group occurs with increases in the perceived 
opinion discrepancy among members. Our data 
enabled us to determine the extent to which the 
group members accurately perceived one 
another’s ethnocentric bias as well as the 
direction, rate, and spontaneity of their discus- 
sion behavior. Since the low Es were more 
accurate than the high Es in identifying the 
attitudes held by others in their group, it 
follows that the low Es should have been more 
impelled to communicate than the high Es, 
who were less cognizant of intragroup dis- 
crepancies in attitude. Our results support this 
prediction, since the low Es displayed higher 
rates of response than the high Es. The data 
show that this difference in response rates be- 
tween high and low Es derived mainly from 
groups in which the low Es were in a minority, 
and it was precisely these groups that the low 
Es were most sensitive to attitudinal differ- 
ences among the group members. Of course, 
the generally greater discussion activity of the 
low Es could have been due in part to the fact 
that the communication supported their point 
of view. 

An analysis of the low E response rate shows 
an increased tendency for these Ss, when in the 
minority position, to interact with both low 
Es and high Es. It might have been predicted 
that the low Es, with film support of their 
views, would have directed the preponderance 
of their comments toward the “deviant” high 
Es. This, in fact, was the case, although they 
also increased the number of remarks directed 
toward those holding similar views. Thus, as 
their number decreased, and as they became 
more aware of the opinions held by the other 
members of the group, the low Es increased 
their rate of communication to both the op- 
position and to those who agreed with them. 
Apparently, communication behavior is not 
designed solely to influence those who disagree 
with one’s own views, but also aims at pro- 
viding and receiving support from those hold- 
ing similar views. 

We obtained support for Festinger’s hy- 
pothesis that increased pressure to communi- 
cate with others occurs with increases in the 
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cohesiveness of the group. Low Es, who had the 
highest rates of response, were more attracted 
to their groups than high Es. Attraction to the 
group for the low Es, however, was not related 
to group structure. In the case of the high Es, 
the greatest attraction to the group was re- 
ported by those in the majority position; and 
it was in these groups that they coramunicated 
more with the low Es than they did with each 
other. Attraction to the group, then, to extend 
Festinger’s suggestion, is apparently an im- 
portant predictor not only of the amount, but 
of direction of communication. 

On the sociometric item, the high Es divided 
their choices about equally between low Es 
and other high Es, regardless of group struc- 
ture. The low Es, however, showed significantly 
greater preference for one another when they 
were in either a majority or a minority position. 
We suspect that this may be a reflection of the 
more active discussions held under these two 
conditions, which gave the participants a 
greater opportunity to assess one another’s 
attitudes. In fact, it was in these groups that 
the perceived consensus judgments by low Es 
were most accurate. Having assessed fairly ac- 
curately the ethnocentric dispositions of other 
members, the low Es preferred those indi- 
viduals whom they perceived as more like 
themselves. The high Es were not as percep- 
tive of differences in group structure and, 
therefore, could not be expected to assign their 
sociometric choices on other than a 50-50 basis. 

Finally, we may note the interesting be- 
havior displayed by the balanced groups, which 
were composed equally of high and low Es. 
Under these conditions, spontaneity, rate of 
response, and number of opposition-directed 
comments were generally lower than in any of 
the other structures. It may have been pre- 
dicted that maximum opinion discrepancy 
would be associated with greater discussion 
activity. Our findings also indicate that the 
members of the balanced groups were least 
attracted to their groups, were least accurate 
in judgments of each other’s attitudes, and 
were less likely to feel compatible with those 
individuals holding opinions similar to their 
own. These results suggest that where opinion 
equilibrium exists in a group, discussion ac- 
tivity is suppressed, with a resultant loss in ac- 
curacy of interpersonal perception and group 
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cohesiveness. One is tempted, in view of the 
previous finding by McGinnies and Altman 
(1959) that middle E groups were less active 
in group discussion than either low Es or high 
Es, to speculate that something analogous to 
an algebraic summation occurs when equal 
numbers of low Es and high Es form a group. 
A balancing of attitudes seems to produce 
discussion behavior similar to that in neutral 
groups. 


SUMMARY 


The purpose of this study was to investigate 
the effects of various degrees of minority- 
majority conditions on member behavior 
during the course of group discussions and to 
ascertain some of the relationships between 
various aspects of interpersonal perceptions 
and discussion behavior. 

On the basis of their scores on a modified 
version of the California E scale, high school 
students were randomly assigned to one of five 
discussion conditions representing various 
degrees of group composition. Two group 
structures had homogeneous membership with 
respect to ethnocentric attitudes, that is, all 
had either high or low scores on the California 
E scale. The remaining three group structures 
were heterogeneous with respect to E scale 
scores and contained either low Es in the 
majority, high Es in the majority, or equal 
numbers of each. 

Each six-man group viewed and discussed a 
film dealing with the general problem of 
ethnic minorities and then completed a 
questionnaire measuring various aspects of 
interpersonal perceptions. Discussion be- 
havior was measured by a series of indices 
coded during the course of group meetings. 

With one exception, the results showed no 
clear-cut and simple group structure effects, 
but indicated an interaction between group 
composition and interpersonal perceptions as 
determinants of discussion behavior. The 
major exception occurred for groups composed 
of equal numbers of high and low Es. In these 
groups, spontaneity, rate of response, and 
number of opposition-directed communica- 
tions were generally lower than in any of the 
other structures. Members of these groups also 
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were least attracted to their groups, were least 
accurate in their perceptions of opinions held 
by other members, and were less likely to 
choose sociometrically others in the group 
holding views congruent to their own. These 
findings suggest that opinion “equilibrium” 
results in a suppression of discussion activity 
and concomitant pressures to influence mem- 
bers holding divergent points of view. 

The results of the study indicated an in- 
timate linkage between interpersonal percep- 
tions and discussion behavior. Those members 
who were most accurate at identifying the 
opinions held by others in the group played a 
larger role in discussions than those who were 
not so accurate. This effect was most pro- 
nounced in the heterogeneous structures, 
where there were divergencies of opinion. In 
addition, there was a tendency for discussion 
pace and opposition-directed communication 
to be positively related to valuation of the 
group. 

The findings lend some support, as well as 
some qualifications, to Festinger’s conclusions 
regarding influence effects in small groups. 
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N 1937, Diven reported an experiment pur- 

porting to demonstrate the conditioning of 

anxiety reactions in human subjects with- 
out their awareness of the conditioned stimu- 
lus. His procedure involved a list of stimulus 
words in which the word “Barn” occurred six 
times. The subjects (Ss) were required to chain 
associate to each of these stimulus words, and 
following their chain associations to the word 
Barn Ss were given a strong electrical shock. 
Diven reported that Ss, although unable to 
verbalize the contingent relationship between 
barn and shock, gave significantly greater 


GSRs to the word Barn than did a control 
group of Ss who had not been shocked. Further, 
he reported that the conditioned GSR showed 
semantic generalization to other rural words 
that had been contained in the stimulus list. 
Haggard (1943) in a somewhat similar experi- 
ment obtained comparable results. 


For obvious reasons, these two experiments 
have played an important role in theorizing 
about psychodynamics. White (1948) among 
others has leaned extensively on the Diven 
experiment in his theory of anxiety and the 
learning of neuroses. 

There are a number of methodological flaws 
in both the Diven and Haggard experiments 
that make the conclusion of unconscious con- 
ditioning of anxiety at best very tenuous. 
Several of these defects have been pointed out 
by Lacey and Smith (1954) who repeated the 
Diven experiment with an improved design. 

They used a somewhat different word list 
and two experimental groups. Ss in the Cow- 
shock group were given shocks following their 
associations to the word Cow while Ss in the 
Paper-shock group were shocked following 
their associations to the word Paper. Heart 
rate rather than GSR was used as the condi- 
tioned response, and the anticipatory changes 
in heart rate that occurred during the S’s 
associations were analyzed instead of the re- 
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sponses following association, as was the case 
in the Diven and Haggard studies. These 
changes in method, however, appear to have 
had little effect since Lacey and Smith con- 
clude that not only did both groups of Ss 
show conditioning of anxiety reactions without 
awareness but also generalized the conditioned 
response to semantically similar words. 

In spite of their changes in design, the 
Lacey and Smith study still lacks some crucial 
controls that would make the interpretation of 
unconscious autonomic conditioning unequi- 
vocal. Eriksen (1958) has pointed out that 
their method of determining awareness in Ss 
and their method of computing conditioning 
are such as to permit a spurious precision in 
the autonomic conditioning relative to the 
verbal awareness. 

They determined awareness by intensively 
questioning Ss as to whether they had learned 
when to expect shock. Ss classified as unaware 
were those who were unable to verbaiize the 
specific contingent relationship between shock 
and the word Cow (Paper). This method of 
questioning, however, may not elicit all the 
information available to the S’s verbal report. 
It is quite possible that the Ss, if asked, could 
have given several or more words which they 
were quite confident were not shocked. It 
seems reasonable to assume that the S’s 
change of heart rate to a stimulus word which 
he is confident will not be followed by shock 
will be less than to those words of which he is 
still unsure (potentially dangerous). It would 
also seem reasonable to expect the actual shock 
word in general to be among those words that 
the S was uncertain of rather than among the 
words he was rather confident were not fol- 
lowed by shock, while a semantically dis- 
similar word would more likely be in the latter 
category. 

The evidence that previous studies have 
found on GSR generalization to words in the 
same semantic classification might actually 
represent not generalization so much as a 
beginning on the part of the S to learn to dis- 
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criminate the general area where shock occurs. 
Thus, Ss in the Cow-shock group would tend 
to classify rural words in the uncertain cate- 
gory and nonrural words including paper in 
the confident not shock category. The reverse 
would occur for the paper shock Ss. In view of 
this possibility it becomes apparent that the 
control measure used to determine whether 
conditioning is occurring becomes quite crucial. 

In the study by Lacey and Smith, condition- 
ing was computed by taking each S’s heart 
rate score to cow and subtracting from it his 
heart rate score to the word paper. It should 
be readily apparent that this method of com- 
puting conditioning is apt to lead to a spurious 
appearance of precise discrimination by the 
heart rate response which in fact may noi exist. 
To show that conditioning of heart rate is 
more specific than the stage of the S’s verbal 
awareness, an analysis is required that demon- 
strates whether or not the S shows greater 
autonomic conditioning to the actual shock 
word than he does to the rest of the words 
concerning which he verbalizes uncertainty 
as to whether they were followed by shock. 

The experiment reported below replicates 
the Lacey and Smith study in its essential 
details with the exception that S’s verbal 
awareness was explored more extensively. 
The S’s verbalizable expectancies concerning 
shock were obtained for each stimulus word in 
order to control for the possible artifact 
described above. In addition, some of the 
characteristics of the experimental situation 
that were related to the Ss’ expectancies were 
explored. 


METHOD 


The procedure was administered to each S individ 
ually. Ss were comfortably seated with the experi- 
mental apparatus screened from their view and were 
instructed that this was an experiment on the effects 
of tension on mental associations. They were further 
told that a list of stimulus words would be presented to 
them one at a time by means of a tape recorder, and 
upon the occurrence of a stimulus word, they were to 
chain associate to this word until the stop signal oc- 
curred. The process of chain association was explained 
to them, and then they were given practice in chain 
association to three nonexperimental words to insure 
that they had adequately comprehended the directions. 

Next, the shock electrodes were connected to the 
right forearm of the S, 2 in. above the wrist with an 
interelectrode distance of 1 in. The electrodes were 
fixed firmly into position by masking tape. The S was 
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told that the shock was an integral part of the experi- 
ment designed to produce tension on his part and would 
occur several times throughout the word list. At this 
point, the S was given several sample shocks in order 
to adjust the voltage level of the shock to the point 
where the S reported it as very painful. The GSR was 
recorded from the dorsal and ventrai surface of the 
middle finger of the left hand through silver electrodes. 
The finger was cleaned with alcohol and treated with 
electrode jelly. GSR records were recorded continuously 
throughout the experimental session. 

Upon completion of the instructions and after a 
short period of adaptation, the experimental session 
began. The tape recorder presented the stimulus words 
in the same order as that employed by Lacey and 
Smith. The words had been recorded by a person 
trained in speech to minimize distortion or confusability 
in pronounciations. Following the occurrence of the 
stimulus word, 15 sec. were allowed for the S’s chain 
associations before the word stop occurred. Fifteen 
seconds then elapsed between the stop signal and the 
occurrence of the next stimulus word. The 40-item 
stimulus list consisted of the words, Cow, Paper, Easy, 
and Harvest six times each, eight other rural words 
and eight nonrurai words (common adjectives and 
nouns) all distributed in temporally balanced order in 
the list. 

During the conditioning session (one presentation 
of the 40-item list), one group of Ss (Cow-shock) was 
administered a painful shock coinciding with the stop 
signal following each presentation of the word Cow. 
The other group of Ss (Paper-shock group) were 
shocked similarly following the word Paper. 

Upon completion of the conditioning phase, the 
following questions were asked and the S’s answers 
recorded verbatim: 

1. Did you know when you were going to get 
shocked? 

2. Please explain freely what you think about the 
shocks in this experiment. 

3. Please try to explain to me what you think or 
guess is the scheme behind the administration of the 
shock. Try to guess in any manner you like—you can 
guess as much as you like, all I’m interested in is what 
you think about or make out concerning shock. 

4. Let me read the list of stimulus words and let us 
see whether you can recognize any particular word or 
words which might have been followed by shock. I 
want to see whether you can distinguish between words 
which might have been followed by shock and those 
which might not have been followed by shock. You 
can say No, Yes, Uncertain or whatever you think 
concerning the probability of the particular word having 
been followed by shock. If you’re not sure, guess. 


An extinction phase followed the inquiry. It was 
identical to the conditioning session with the exception 
that no shocks were administered although the elec- 
trodes were left undisturbed and the Ss were not 
informed that there would be no further shocks. 

Apparatus. Electric shock was administered by 
means of a Harvard inductorium using a faradic 
stimulation. The primary coil was supplied from a 
3-v. de source. The voltage of the secondary was 
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adjusted for each S to a level that was reported very 
painful. 

The GSRs were continuously recorded with a 
modified Deceptograph supplied by C. H. Stoelting 
and Company. The modification was such as to permit 
the continuous reading of the S’s basal resistance level 
in ohms. 

Subjects. The 22 Ss consisted of 4 men and 18 women 
who were volunteers among graduate students in the 
College of Education at the University of Illinois. They 
were randomly assigned to the Cow-shock and Paper- 
shock groups. None was familiar with the original 
Diven experiment nor the experiments of Haggard and 
Lacey and Smith. The Cow-shock group contained 12 
Ss and the Paper-shock group 10. 


RESULTS 


On the basis of the Ss’ verbalizations during 
the inquiry, they were classified into aware 
and unaware Ss. Three Ss in the Cow-shock 
group and one in the Paper-shock group were 
able to correctly verbalize the contingent 
relation between shock and the correct stimu- 
lus word. The remaining 18 Ss were classified 
as unaware. Their verbalizations were vague 
concerning the stimulus word followed by 
shock, although some verbalized incorrectly 
but with confidence that particular words had 
been followed by shock one or more times 
during the conditioning session. The four 
aware Ss have been excluded from the follow- 
ing analyses. 

The first step in the analysis was to deter- 
mine the extent to which the present data 
duplicated the findings of Lacey and Smith. 
Complete GSR records were available on seven 
unaware Ss in each of the two shock groups.” 
The maximum GSR deflection in micromohs 
of conductance change that occurred during the 
15-sec. association interval was determined 
for each of the 40 stimulus words for each S. 
The list of stimulus words was then divided 
into four blocks of ten consecutive words each 
and the mean and SD of the GSR deflections 
for each S for each of these four blocks was 
computed. GSRs to the different words were 
then converted into z scores based upon the 
mean and SD of the 10-word block in which 
they occurred. These transformed GSR z 
scores were used in all the subsequent analysis. 

To render our data comparable with those 
of Lacey and Smith, their method of deter- 

? Due to recording difficulties, the GSR records for 


two Ss in each group were not complete. These four 
Ss were excluded from the analyses of the GSR data. 
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mining conditioning was first computed. In 
this method, the transformed GSR 2z score to 
paper (in their study, heart-rate) is subtracted 
from the response to Cow for each S. Since 
words in corresponding positions are taken 
for comparison, any temporal trend is con- 
trolled. The technique of moving averages by 
taking the mean of consecutive blocks of three 
trials is used to smooth out irregularities in 
the conditioning curves for the two shock 
groups. 

The results are graphed in Fig. 1. In terms 
of this measure of “conditioning” the Paper- 
shock group gives evidence similar to that 
obtained by Lacey and Smith with the maxi- 
mum GSR occurring on the sixth block of 
trials, following which extinction takes place. 
For the Cow-shock group, however, there 
appears to be no consistent evidence of con- 
ditioning until approximately the seventh 
trial which is in the extinction period and 
after the pause for questioning. By the tenth 
trial, however, there is some evidence of 
extinction in this group. 

Lacey and Smith evaluated the significance 
of these trends by the nonparametric one 
sample runs test applied to the response differ- 
ences block by block. Applying this test to 
our data, we find that only in the seventh block 
was a difference between the two groups sig- 
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Fic. 1. Conditioning curves computed by the method 
of Lacey and Smith (1954). 
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TABLE 1 
Mean TRANSFORMED GSR READINGS FOR THE SHOCK 
Groups ror DirrereENt GRrovupiIncs oF Worps 
DURING THE CONDITIONING SESSION 


’ 


Re- ted Nonre-| 


| 
Words 


7 vee 


won| similor 
| | = rds | 


Shock 


Shock-group |Word 


| 


—0.42| 1.81 
| —1.33 | 


Cow-shock group | 0.38 ~0.004 | 
Paper-shock group 1.82 1.37 


nificant at the .05 level (one-tailed). However, 
summing for each S$ through the six condi- 
tioning trials and testing the mea. difference 
between the Cow-shock and Paper-shock 
group resulted in a ¢ value of 1.30, which with 
12 degrees of freedom is not statistically 
significant. Summing through eight trials 
(taking the six conditioning trials and the first 
two extinction trials together) and then testing 
the mean difference for the two shock groups 
yields a ¢ of 2.83, which is significant beyond 
the .01 level (one-tailed). A similar / test 
applied to the sum of the six extinction trials 
resulted in a value of 2.31, which again with 
12 degrees of freedom is significant beyond 
the .03 level (one-tailed). 

Evidence of conditioning within each of the 
two groups during the conditioning phase was 
evaluated by a three-way analysis of variance 
(trials, Cow and Paper, Ss) of the z scores to 
Cow and Paper. A separate analysis was per- 
formed for each group. In these analyses con- 
ditioning to the shock word would be reflected 
in a significant interaction between trials 
and the Cow and Paper words. 

Only in the Paper-shock group was there 
evidence of conditioning with the interaction 
term significant at approximately the .05 level 

= 2.50 with 5 and 30 df). The main effects 
did not approach significance in either group 
and the absence of significant interaction in 
the Cow-shock group provides no evidence of 
conditioning for these Ss. 

Lacey and Smith also found that the condi- 
tioned responses tended to generalize in the 
respective groups to words of a similar seman- 
tic classification. Thus, the Paper-shock group 
showed a greater response to nonrural words 
than to rural words, whereas the Cow-shock 
group gave greater responses to the rural 
words than to the nonrural words. The present 
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data were examined for evidence of this type 
of generalization. In Table 1, the mean trans- 
formed GSR readings for the two experimental 
groups during the conditioning session are 
shown as a function of different groupings for 
the words. As is seen, the mean GSR for the 
nonrepeated similar words in both groups is 
higher than that for the actual shock words, a 
finding which is in conformity with that of 
Lacey and Smith who also noted that the 
generalized response magnitude tended to be 
higher than the response to the actual condi- 
tioned stimulus. 

The significance’ of the differences observed 
in Table 1 was evaluated by a modified three- 
way Classification analysis of variance (groups, 
word classification, and individuals). The word 
classification was broken down into shock 
words, repeated similar words, repeated non- 
similar words, nonrepeated similar words, 
and non-repeated nonsimilar words. The results 
of this analysis failed to show any significant 
effects, with the F ratio for between word 
classifications only 1.54 (4 and 48 df, p > 
05). 

Since, as will be seen below, repetition of 
words in the list has a confounding effect upon 
any measure of conditioning or awareness, a 
more meaningful test for generalization is in 
terms of the nonrepeated rural and nonre- 
peated non-rural words in the list. Again a 
modified three-way classification analysis of 
variance similar to the above was performed 
but in this case only two word classifications 
were used, nonrepeated rural and nonrepeated 
nonrural words. None of the main effects in 
this analysis was significant, but the interac- 
tion between the two shock groups and rural- 
nonrural word classification was significant 
beyond the .05 level (F = 5.87 with 1 and 12 
df). This significant interaction reflects the 
trend that can be seen in the above table 
where it is found that the Paper-shock group 
shows greater GSR to nonrural words while 
the reverse effect is true for the Cow-shock 
group. Similar analysis of variance performed 
upon the extinction trials failed to yield any 
significant effects. 

The data analyzed so far are in general agree- 
ment with the results presented by Lacey and 
Smith. For the Paper-shock group, at least, 
they could be interpreted in terms of rather 
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TABLE 2 
LEVEL OF CERTAINTY OF SHOCK SCALE AND 
ScALE ScoREs 


Verba! Responses included ia 
Fach Scale Category 


Seale | « . 
; S. 
Score | 5 cale Category 


Yes Yes 

Once or twice Maybe once or twice, Once, 
Once or twice 

Maybe, Perhaps, Possibly, 
Could be, Probably, I 
think so, I believe so 
JIncertain, Not sure 
don’t know, I don’t re- 
member, Never heard, 
Didn’t hear, Can’t see any 
connection 

| I don’t think so | I don’t think so 

| No No 


May be 


> on . 
Uncertain 
I don’t know 


specific autonomic conditioning in the absence 
of verbal awareness. However, as was previ- 
ously pointed out, this evidence for specificity 
of conditioning might very well be an artifact 
of the method of computing or analyzing con- 
ditioning. The analysis that has been con- 
ducted so far has not determined verbal 
awareness in the Ss in a manner that is com- 
mensurate with the analysis of autonomic dis- 
crimination or conditioning. While all of the 
Ss contributing to the data were unaware in 
the sense that they could not verbally report 
that shock followed Cow (Paper), they none- 
theless had various expectancies concerning 
the relations between certain words and 
shock. The existence of these expectancies 
was revealed by the more detailed assessment 
of awareness that we employed in this study. 

As will be recalled, we presented each of the 
20 different stimulus words to the S and asked 
him to state whether or not these particular 
words had been followed by shock. It was pos- 
sible to sort their replies on a seven-point scale 
defining the expectancy level of the S as to 
whether a word had been followed by shock. 
Table 2 shows how the Ss’ replies were classi- 
fied in terms of the scale scores. 

For the Ss in the Paper-shock group, the 
average number of words per S receiving a 
rating of three or higher was 5.4 (range 3-8). 
For the Ss in the Cow-shock group, the mean 
was 6.8 (range 5-12). It was further found that 
five of the seven Ss in the Paper-shock group 
had given a value of three or higher to the 
word paper, and four of the seven Ss in the 


Cow-shock group had given a value of three or 
higher to the word Cow. The words receiving 
scale values of three or higher may be con- 
sidered to be perceived by the Ss as potentially 
dangerous relative to the words having scale 
values that are lower than three. 

Our next step was to determine whether the 
GSR response was greater to the actual shock 
word than it was to the other words that S 
perceives as potentially dangerous. For each 
S, the GSRs in z score units during the condi- 
tioning trials were averaged for all words to 
which he had given a rating of three or higher, 
excluding the actual shock word. The difference 
between this mean GSR to the words per- 
ceived as dangerous and the GSR to the actual 
shock word was computed for each S, and the 
mean of the differences by experimental group 
was evaluated with a / test to see whether it 
differed significantly from zero. For the Paper- 
shock group, the mean of the z scores for the 
shock word was .18 as was compared with a 
mean of .26 for the words with scale‘values of 
three or more. The mean difference of .08 
although not significantly different from zero 
(¢ = .09) is actually in the opposite direction 
from what would be expected if the S were 
unconsciously discriminating the true shock 
word from the other words perceived as 
dangerous. For the Cow-shock group, the 
comparable means were .06 and —.16. Again 
the mean difference (.22) although now in the 
direction for unconscious conditioning, is not 
significantly different from zero (¢ = 1.54, 
p> .05). It would appear therefore, that in 
the present experiment the autenomic condi- 
tioning manifested by the Ss is no more specific 
or precise than their verbally stated awareness. 

It is possible now to show not only that the 
evidence given by the Paper-shock group for 
autonomic conditioning without awareness is 
artifactual, but also why the Cow-shock group 
did not show such evidence. Examination of the 
S’s expectancy of shock for the different words 
shows that only two of the seven Ss in the 
Paper-shock group report having been shocked 
to the word Cow while five of the seven Ss in 
the Cow-shock group report having been 
shocked to Paper. Since the GSR tends to be 
greater to words which Ss report as having 
been followed by shock, assessing conditioning 
for the Ss in the Paper-shock group by sub- 
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TABLE 3 
KENDALL’s COEFFICIENT OF CONCORDANCE, W, OBTAINED FROM DISTRIBUTION OF RANKED LEVEL OF CERTAINTY 


oF SHOCK FO 
Grouping 

Cow-shock group: Rural words only 
Cow-shock group: Nonrural words only 
Cow-shock group: All words together 
Paper-shock group: Rural words only 
Paper-shock group: Nonrural words only 
Paper-shock group: All words together 


tracting the autonomic response to paper from 
the autonomic response to Cow would give 
the general results shown in Fig. 1.’ Also, the 
fact that a high number of Cow-shock Ss 
report having been shocked to Paper would 
lead us to expect negative results from this 
type of subtractive computation. 

It is informative to analyze the expectancies 
of the Ss further in order to identify features 
and characteristics of the experimental ar- 
rangement that lead Ss to identify certain of 
the stimulus words as being associated with 
shock. First, we examined the consistency 
among Ss within experimental groups in the 
assignment of their expectancies to different 
stimulus words. For this purpose, Kendall’s 
coefficient of concordance (W) was employed. 
Table 3 shows the Ws for rural words, non- 
rural words, and all words together between 
Ss in each of the two experimental groups.‘ 

As can be seen from the table, the Ss in 
the Cow-shock group did not show a signifi- 
cant amount of consistency in their expec- 
tancies for the different words together or 
when broken down into separate categories. 
The Paper-shock group on the other hand 
showed significant consistency in all three 
analyses. Thus, only in the Paper-shock group 
is there evidence that the administration of 
shock had a systematic effect upon the 
hypotheses the Ss construct concerning shock. 

Despite lack of consistency among the Cow- 
shock Ss there is actually considerable con- 

3 This method of assessing conditioning for the two 
Ss in the Paper-shock group who reported having been 
shocked to Cow shows little or no evidence of con 
ditioning. This result is to be expected in view of the 
fact that they had also included Cow as a potentially 
dangerous word in their verbal reports. 

‘For these analyses, V in each group is nine since 
the expectancy dats were obtained for two Ss in each 
group on which complete GSR data was not available. 








No. of $s. No. of Words Kendali's Coeff. signiticance (p) 





0.127 
0.160 
0.129 
0.255 
0.468 
0.360 


9 10 
) 10 
9 20 
9 10 
9 10 
9 20 





sistency in the assignment of certainty to 
words between the two shock groups. If the 
average expectancy for each of the 20 different 
stimulus words is computed separately for the 
two experimental groups, the rank order 
correlation of these expectancies between 
groups for the 20 words is .62 (p < .05). This 
finding would suggest that there are some 
elements in the experimental arrangement 
that biased the Ss’ ideas concerning the 
occurrence of shock. 

To identify some of these factors, the stimu- 
lus words were subdivided into five categories: 
the shock word; repeated similar words; re- 
peated dissimilar words; nonrepeated similar 
words; and nonrepeated dissimilar words. A 
modified three-way classification analysis of 
variance (groups, word classification and indi- 
viduals) were performed on the expectancy 
scores for the different words. This analysis 
revealed a significant variation between word 
classification (F = 6.81 with 4 and 64 df, 
p < .01), but there were no significant differ- 
ences between the two groups nor was the 
interaction between groups by word classifi- 
cation significant. Inspection of the cell means 
from this analysis suggests that the significant 
effect due to word classification is attributable 
to the tendency for words that have been 
repeated six times in the list to have higher 
shock expectancy than words which have oc- 
curred only once, and, further, that words 
belonging to different semantic categories are 
also being discriminated with regard to ex- 
pectancy. Apparently, the Ss have begun to 
differentiate at the verbal level the experi- 
mental situation in terms of two relevant 
characteristics: the repetition of certain stimu- 
lus words and semantic classification. 
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DISCUSSION 


Lacey and Smith (1954) in their penetrating 
analyses of the Diven (1937) and the Haggard 
(1943) experiments have quite accurately 
pointed out that neither of the experiments 
contained the necessary controls to support a 
conclusion of “unconscious conditioning.” 
Diven compared the GSR of the experimental 
Ss to the shocked word with that of a control 
group who had received no shocks at all. 
As Lacey and Smith (1954) point out, this com- 
parison is invalid in the absence of a demon- 
stration that the experimental group’s reac- 
tivity to all words was not greater than the 
control group’s. This criticism also applied to 
Haggard’s study. And as was seen above, while 
Lacey and Smith use a more sophisticated 
and appropriate statistical test of conditioning, 
they still have not excluded the likelihood that 
their evidence of “unconscious conditioning 
of anxiety” is artifactual. 

The methodological flaw in the Lacey and 
Smith study is the same one that has plagued 
a number of attempts to demonstrate learning 
without awareness. Adams (1957) has termed 
it the problem of correlated hypotheses and 
has shown how failure to consider this problem 
has invalidated many of the experiments in 
this area. 

The problem arises because these experi- 
ments focus on and record only a limited 
aspect of the Ss’ behavior. When the S is 
examined for awareness, he is classified as 
unaware if he is unable to repeat the verbaliza- 
tion the EZ has formulated. But his over-all 
behavior in the experiment may be in perfect 
correspondence with his verbalizations. What 
the experimenter overlooks is that the change 
or modification in the observed response could 
actually be correlated with a number of differ- 
ent hypotheses. Convincing research in learn- 
ing without awareness must of necessity 
thoroughly explore the Ss verbalizable hy- 
potheses and expectations and only when it is 
shown that these hypotheses are unable to 
account for the observed change in the response 
is a conclusion of learning without awareness 
warranted. 

At present we are left with no convincing 
evidence that specific autonomic conditioning 
occurs in the absence of verbal awareness. 
Razran (1949) has reported salivary condi- 
tioning in human Ss without their awareness, 


but as Lacey and Smith point out, Razran’s 
description of his method of determining 
awareness is so sketchy as to leave serious 
doubts as to whether he successfully avoided 
the pitfalls that have been discussed. 

We would suggest that the future investiga- 
tions in this area would do well to abandon 
the method originated by Diven (1937). Our 
results suggest that there are so many uncon- 
trolled and confounded sources of variance in 
this design as to render virtually impossible an 
answer to the basic question of conditioning 
without awareness. Both the S’s expectancies 
of shock as well as his GSRs were a function 
of the frequency of repetition of certain words 
in the list. The introduction of new words 
toward the end of the list also seemed to 
influence the GSR, perhaps as a result of their 
novelty or unexpectedness. In addition, the 
15-sec. CS-US interval is quite long for condi 
tioning to occur, since the optimum interval! is 
generally considered to be much shorter. 

Generalization of the autonomic response 
would more appropriately be measured by the 
introduction of new words after the condi- 
tioning trials have taken place. The present 
design results in a certain amount of masking 
of what generalization effects there are due to 
the fact that the S is presented the generaliza- 
tion stimuli interspersed with the conditioning 
trials and thus is learning a condiioned dis- 
crimination. 

There is one other aspect of our data that 
deserves brief comment. This is the increase in 
GSR to the conditioned stimulus in the Cow- 
shock group during the extinction trials, 
(Fig. 1), an effect which is particularly in- 
teresting since the Ss in this group gave no 
evidence of conditioning during the six condi- 
tioning trials. It is impossible to evaluate the 
meaning of this effect since, unfortunately, we 
did not requestion the Ss following the extinc- 
tion phase of the experiment. What changes 
may have occurred in their expectancies con- 
cerning shock as a result of the inquiry period 
can only be surmised. 


SUMMARY 


This study was essentially a replication of 
the Lacey and Smith (1954) experiment on the 
unconscious conditioning of autonomic re- 
sponses. When their methods of data analysis 
were used, their results in general were con- 
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firmed. However, due to a more extensive and 
careful investigation of the Ss’ awareness in 
the present study, it was possible to show that 
the conditioning of the autonomic response 
(GSR) was no more specific than the Ss’ 
verbalizations. It was also shown that the 
method employed by Lacey and Smith to 
compute conditioning gives rise to a spurious 
conclusios. of precise autonomic conditioning 
without verbal awareness. The limitations of 
their basic design for investigation of the 
problem of unconscious conditioning were 
pointed out. 
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NVESTIGATIONS of “subliminal” ‘influences 

on behavior are not new in psychology; 

studies in this area date back to the 19th 
century. However, judging from recent reviews 
of the literature by Adams (1957), McConnell, 
Cutter, and McNeil (1958), Eriksen (1958), 
and Goldiamond (1958), a resurgence of 
interest in the area is underway. 

Most of the studies reviewed by Adams 
(1957) require a subject to make a stimulus 
discrimination. In one group of these experi- 
ments, the subject knows the nature of the 
cue but his discrimination is better than 
chance at stimulus intensities at which he 
thinks he is guessing. In another group of 
experiments the subject does not know that a 
cue to discrimination is being given. Only 9 
of the 57 studies in Adam’s review do not 
involve a stimulus discrimination task. Seven 
of these are essentially operant conditioning 
studies in which the subject does not realize 
he is being conditioned because he is unaware 
of either the reinforcer or the class of responses 
being reinforced, or of both. The use of 
subliminal suggestion in movie and television 
advertising seems to belong with this latter 
group, for the subject’s operant behavior is 
supposedly brought under control by sugges- 
tive cues of which he is not aware. The present 
study represents an attempt to construct an 
analogue of this type of situation. The question 
asked in the first experiment was: “Can one 
influence frequency of a graphomotor response 
by suggestions administered visually at sub- 
liminal intensities?” 

EXPERIMENT I 

The experimenter tried to control the 
S’s verbal productivity (the number of words 
describing pictures) with two 
suggestions: WRITE MORE and 


written in 
subliminal 


1 James A. Norton, Department of Mathematics 
and Statistics, Purdue University, was the statistical 
consultant for this study. He devised the modifications 
of standard designs and tests of significance used in the 
analyses of the data 


DON’T WRITE. A control group of Ss described 
the pictures without subliminal suggestion. 
Each of these suggestions was typed in capital 
letters on a paper-roller fitted into a tachisto- 
scope. Three TAT cards—4, 6BM, and 


7GF—were used as the pictures to be described 
in writing. These pictures, which each portray 
two persons with one turned away from the 
other, were rotated over the conditions in the 
experiment so that each picture appeared an 
equal number of times in each condition. 


Subjects 
The 36 Ss consisted of 33 student nurses and three 


secretaries. Their average age was 20.0 (SD = 1.3). 
The student nurse Ss were paid volunteers 


A pparatus 


The apparatus used was the Harvard Tachistoscope, 
a modification of the Dodge Mirror Tachistoscope. 
This model, manufactured by Ralph Gerbrands Co., 
has dual timers permitting control of two stimulus 
fields. Its fluorescent lamps may be turned on and off 
instantly. Dual mechanical timers provide exposure 
intervals of .01 to 1 sec. for two sets of laraps, each set 
controlling a field of exposure. The duration of the 
overlap or interval in the presentation of the fields is 
controlled by a third timing mechanism. Single pieces 
of Atlantic bond paper (20 wt.) were used as filters on 
the two lamps controlling the second exposure field. 


Procedure 


The Ss were told that they would be shown three 
pictures in the tachistoscope and that each picture 
would be shown 10 times, each time for a slightly longer 
interval than the preceding time. After each presenta- 
tion, they were to write on a pad everything they saw 
in the picture, including everything that was happening 
in the picture. They were told that it was permissible to 
repeat what they had written on previous presentations 
of the same picture but they were told not to use ditto 
marks or words like “same as before.” 

Before starting the experiment the word “samMPpLe”’ 
was presented for .02 sec. against a blank, lit back- 
ground field. The S was asked if she had seen either a 
word or letters presented. If (as was the case with two 
Ss) she did report a word or letters, one additional 
bond paper filter was inserted in front of each of the 
two lamps controlling the second exposure field, which 
contained the word; to insure that the experimental 
words would be subliminal. All but two Ss reported 


40-4 





SUPRALIMINAL VS. SUBLIMINAL SUGGESTION 


seeing merely a “flash,” a “blur,” or, in a few cases, 
“lines,” 

Condition I was the same for both experimental and 
control Ss. The first exposure field, containing a TAT 
picture, was exposed ten times for successive durations 
beginning at .05 sec. on the first trial and lengthening 
.05 sec. on each successive trial, reaching .5 sec. on the 
10th trial (ostensibly, the experiment was to test the 
effect of different durations of presentation on per- 
ceptual acuity). The second exposure field was blacked 
out except for an exposure slot in which the stimuli 
were to appear during the experimental conditions. 
During Condition I this slot was blank, resulting in a 
flash of .02 sec. duration overlapping with the last .02 
sec. of the presentation of the TAT picture and super- 
imposed on the picture. After each trial the S was 
allowed to write her description of the picture. She 
was instructed to say “ready” whenever she finished 
writing. No time limits were placed on the S. However, 
because of the close scheduling of .Ss it was necessary 
to discard Ss who took longer than half an hour to 
complete the ten trials of Condition I. Three Ss were 
discarded because of this limitation. As soon as the S 
signalled that she was ready for the next presentation, 
the examiner gave a ready signal and pressed the 
timer button. 

In Condition II, a second TAT card was presented. 
Ss followed the same procedure as in Condition I. The 
Ss in the control group again had a blank in the second 
exposure field. However, the Ss in the experimental 
group were given the stimulus words “WRITE MORE” in 
the second exposure field on each of the ten trials. This 
stimulus overlapped for .02 sec. with the last .02 sec. 
of the picture being presented in the first field. It 
appeared approximately in the middle of the picture. 

In Condition III a third TAT picture was presented 
with the Ss following the same procedure. The Ss in 
the control group again had a blank in the second 
exposure field. The Ss in the experimental group were 
given the stimulus words “pon’T write” in the second 
exposure field, again overlapping for .02 sec. with the 
end of the exposure period of the picture. 

After the last trial on the last condition, a blank 
card was inserted in the first exposure field which now 
served as a lit background field. The stimulus words 
from the last condition were presented at the .02 sec. 
setting and the S was asked to describe what she saw. 
The stimulus presentation was lengthened .01 sec. on 
each successive trial until the S could correctly read it. 
This threshold was obtained in order to determine how 
close to threshold the S was functioning during the last 
condition of the experiment. The mean threshold for 
the 18 Ss in the experimental group was .08 sec. (SD 
= .04). However, it was discovered in the second 
experiment to be reported that it is much more difficult 
for Ss to recognize words presented at the end of a 
primary stimulus to which they are attending, than it 
is to recognize the stimulus against a blank background. 
It is likely that the actual threshold for the words 
against the background of a complex picture was much 
higher than the threshold obtained against a blank 
background. However, on the assumption that this 
measure might have some correlation with the real 


TABLE 1 
EXPERIMENTAL DESIGN 
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Group and | N 








Conditions 





Card Order . | 


II 





Control | 
ABC 6) | 
BCA 6} 
CAB 6) 


.02-sec. 
blank 


| .02-sec. 
blank 


.02-sec. 
blank | 


Total 18 


Experi- 
mental 
ABC ) | | .02-sec. 
BCA hina | ween 
CAB (“Write 
More”’) 


02-sec. 
words 
(“Don’t 
Write’) 





Total 18 
Total Ss 36 


threshold, it was examined in relation to the magnitude 
of the suggestion effect. 

After the threshold was obtained, Ss were carefully 
questioned as to whether they saw any words being 
presented with the pictures. Only one S in these two 
groups reported seeing the stimulus words and she was 
discarded as a S in the experiment. 

Ss were urged not to tell any details of the experi- 
ment to other nurses. Actually almost all of them were 
still unaware of the nature of the experiment. Most of 
them seemed surprised when they were asked if they 
noticed words presented with the pictures. New 3s 
were questioned as to what they had heard about the 
experiment and as far as could be determined, none of 
them had heard that words were presented along with 
the pictures. 


Design 
The design is presented schematically in Table 1. 
The design was one of replicated 3 X 3 latin squares. 


The complete a .alysis of this design is an extension of 
the analysis presented by Edwards (1950, p. 326). 


Results 


The “Residual within Individuals” sum of 
squares in the design is made up of six parts: 
a, Subjects X Conditions interaction (with 
10 df) from each of six Group X Order of 
Presentation combinations. These six con- 
tributions were computed separately and 
tested for homogeneity with the Bartlett 
test, which yielded a chi square of 15.09 
(significant at the .05 level). Since the variance 
was heterogeneous a logarithmic transforma- 
tion was employed. Bartlett’s test was applied 
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TABLE 2 
ANALYSIS OF VARIANCE 


Between Groups 
Between Orders 
Groups X Orders 
Between Ss within Groups | 
X Orders 
Between Conditions 
both groups) 
Conditions X Groups 
Between Cards 
Cards X Groups 
Latin Sq. Residual 
Latin Sq. Residual X Groups 
Residual within Ss 


(avg. | 

| 101 | 12.64% 
| .027 | 3.44* 
.001 | .16 

| .013 | 1.66 

| "006 75 

| .030 3.81* 
| .008 | 


Srwrwwnrww 


— 
So 
~ 


Total 


*p= .05 
> = 00! 


TABLE 3 
Groups Means (Locs) ON CONDITIONS 


I ll In 
2.11 2.15 
2.22 2.31 





2.24 
2.29 


Control 
Experimental 


to the transformed data and the resultant 
chi square of 10.43 was not significant at the 
.05 level. 

Table 2 presents the results of the complete 
analysis of variance. For the central hypothesis 
only the Conditions X Groups interaction is 
relevant. The effects of Cards and Order of 
Presentation of the Cards are only of secondary 
interest. 

The Groups X Conditions interaction mean 
square was .02736, giving an F of 3.44 with 2 
and 60 df, significant at the .05 level. Thus, 
the subliminal stimulus conditions did result 
in a difference betweeen groups. The nature 
of this difference can be seen in Table 3 which 
gives the means of the two groups (in logarithm 
scores) for the three conditions. The control 
group showed a steady rise in productivity 
going from Conditions I to III. The experi- 
mental group showed a slightly greater rise in 
productivity from Condition I (blank) to 
Condition II (wRrITE MoRE) and showed a drop 
instead of a rise from Condition II (write 
MORE) to Condition III (DON’T wRITE). 

In order to examine more closely the effect 
of the subliminal stimulus, individual com- 


parisons were carried out on the difference in 
mean change from one condition to another in 
the two groups.* 

Three ¢ tests were performed between the 
three Conditions. Only the difference between 
groups from Condition II to Condition III 
was significant (¢ = 2.61, » = .025 for a one- 
tailed test). Apparently the subliminal sug- 
gestion “DON’T wRITE” played the greater 
role in the interaction between groups and 
conditions. On Condition ITI, 13 of the 18 Ss 
in the experimental group decreased their pro- 
ductivity (in accord with the subliminal sug- 
gestion) while only 4 of the 18 Ss in the con- 
trol group decreased their productivity. 

Condition III consisted of 10 trials. On each 
successive trial the picture stimulus was ex- 
posed for a longer interval. One might expect 
the subliminal suggestion to have some kind of 
increasing effect as trials progress. Figure 1 
contains plots of the mcan and median (log- 
arithms) numbers of words written on each 
trial for the two groups. There was considerable 
variability within groups on single trials caus- 
ing the means to be somewhat distorted by 
extreme results. In terms of mean scores, it 
can be seen that the experimental group 
started on Trial 1 at a considerably higher 
level of productivity. This difference is prob- 
ably a carry-over from Condition II. The large 
initial difference between the groups on Trial 
1 disappears by Trial 3 as a result of the rapid 
increase in the Control group and the decrease 
in the Experimental group. Beyond this trial 
the groups do not markedly diverge. The 
medians present a somewhat different picture. 
The experimental group remains more pro- 
ductive up to the seventh trial, and from the 
seventh through the tenth trial drops below 
the control group. With the exception of an 
“end-spurt” in both groups, there is a tendency 
for the experimental group to decrease while 
the control group increases over the successive 


? For instance, for the change from Condition II to 
III we have: 


y= Par — Pan) — Pon 


Vv MSs . 


an Bi 
Faw) vith 60 af 


where Y.11 is the mean for the experimental group on 
Condition II, etc. These #’s were judged for significance 
using Tukey’s Studentized Range test for multiple 
comparisons (Scheffe, 1953), the .05 level in this case 
being 2.40. 
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Fic. 1. Means and Medians (logarithms) in the Experimental Group on the Ten Trials of Condition III (“pon’r 


WRITE”). 


trials. Thus, there is some indication that the 
subliminal suggestion “DON’T WRITE” has a 
cumulative effect, when presented over a series 
of trials. 

In order to examine the relationship between 
the S’s threshold and her response to the sub- 
liminal suggestion, a rank order correlation 
was performed between the percentage drop in 
productivity on Condition III in the Experi- 
mental group and the 5S’s threshold for the 
words “DON’T WRITE” obtained against a blank 
background at the conclusion of the experi- 
ment. The 18 experimental group Ss were 
ranked from the greatest percentage increase 
to the greatest percentage decrease in pro- 
ductivity on Condition III, and thresholds 
were ranked from the highest to the lowest. 
The rho obtained between these two rankings 
was —.32. Although this value falls short of 
significance, the direction of the correlation 
suggests that high thresholds tended to be 
associated with low, or weak, response to the 
subliminal suggestion. 

The results up to this point indicated that 
one of the subliminal suggestions had a sig- 
nificant influence on the productivity of the 


Ss. The direction of the correlation between 
the threshold and the magnitude of this in- 
fluence suggested that, perhaps, the influence 
of the suggestion increases as the S becomes 
more aware of it. A third group of Ss was there- 
fore tested, this time increasing the duration of 
the suggestion stimuli so that they would be- 
come supraliminal. 


EXPERIMENT II 
Method 


Eighteen more student nurse Ss composed the third 
group of the experiment. Their mean age was 19.94 
(SD = 1.30). The procedure was the same as that of 
the previous experimental group except that this time 
the duration of the suggestion stimulus was increased 
to .5 sec., its onset beginning .02 sec. before the shut-off 
of the picture stimulus. It was assumed that this 
half-second duration would make the suggestion 
stimulus supraliminal for all Ss. The threshold for the 
Condition IIT suggestion stimulus presented against a 
blank background had not exceeded .2 sec. for any of 
the Ss in Experiment I. 


Results 


To the surprise of the experimenter, 10 of 
the 18 Ss still did not report seeing either of 
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the suggestion stimuli when they were ques- 
tioned after the experiment. The remaining 
eight Ss reported seeing one or both of the 
suggestion stimuli, during the experiment. 

The Ss who did not report seeing the sug- 
gestion stimuli were shown the succession of 
picture and suggestion stimuli with the in- 


structions: 


Now watch carefully. There are some words follow- 
ing the picture. See if you can read them 


After being “set” all except one of the 10 Ss 
were able to report the suggestion stimulus cor- 
rectly. They were questioned as to why they 
thought they had not seen the stimulus pre- 
viously. From their comments it was apparent 
that most of them were close to threshold but 
were not attending to the second stimulus. 
Four of the 10 were aware that words were 
being presented but could not read them. Their 
comments were as follows: 


A. I noticed words—saw “Don’t.”’ 

B. I couldn’t read the words. 

C. I remember I saw little squares. . . blur in it. I 
thought it was the tag on the bottom of the picture. 
If I’d been paying attention I could have seen it. 

D. It looked like it might be the title of the pic- 
ture. .. . Printing in the center of the card I was looking 
at. 

Three Ss were aware of only the flash. Their 
comments were: 


E. I noticed a flash. I guess it was because I was 
concentrating so hard on the picture. . . . I was annoyed 
at the light... .Just trying to get it out of the way 
more or less 

F. I saw a flash but it was too quick. 

G. Flash . .. but I was just expecting pictures. 


The remaining three Ss did not notice any- 
thing but the pictures. 


H. I didn’t expect to see any words. 
I. Was that there all the time? 
J. It wasn’t that slow before. 


Results for these 10 Ss were analyzed as a 
second subliminal group on the assumption 
that they were closer to threshold (or even 
partially aware of the stimulus) than the .02 
sec. subliminal group; the remaining eight Ss 
were regarded as a “supraliminal” group. Com- 
paring these two groups with the two groups 
from the previous experiment might enable 
one to determine the relationship between 
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closeness to threshold, or degree of awareness, 
and effect of suggestion. It was assumed that 
the four groups lie on a continuum of aware- 
ness in the following order: Control group, 
Subliminal .02 group, Subliminal .5 group, 
Supraliminal group. 

The variability introduced by the last two 
groups made a parametric analysis inappro- 
priate even for log converted data (Bartlett’s 
test yielded a chi square of 75.95). The sta- 
tistical method used was the Kruskal Wallis 
One-Way Analysis of Variance by Ranks 
(Siegel, 1956). The percentage increases and 
decreases of the 54 Ss in all four groups were 
ranked from the greatest decrease to the 
greatest increase. The mean ranks in each of 
the four groups on the percentage change from 
Conditions I to II and the percentage change 
from Conditions II to III can be seen in Table 
4. Large percentage drops in productivity 
lower these means while large percentage in- 
creases raise them. The median percentage 
drop or increase in productivity is also in- 
dicated in this table. 

The value of H, after correction for ties, for 
the change from Condition I to II was 2.61, 
which was not statistically significant (df = 3). 
The value of H for the change from Condition 
II to III was 13.33, which was significant below 
the .01 level. Since the mean ranks of the four 
groups were significantly different on the per- 
centage change from Conditions II to III in- 
dividual tests of significance were performed 
on differences among the mean ranks.’ The 
z values resulting from these tests can be seen 
in Table 5. Both the .02 and .5 sec. Subliminal 
groups show a significantly greater drop in 
productivity on Condition III than the Con- 


* For the difference between the mean ranks of any 
two groups, @ and }, 


N(N + 1)(\N — 1) — 2T 


¢ = 
12(N — 1) 

In this application, N = 54, and 27 was 126. (It can 
be shown that if there were only two groups, * = #). 
To test these zs for significance, the Tukey Studentized 
Range Test was again used, this time with df = ~. 
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TABLE 4 


MEDIAN PERCENTAGE CHANGES AND MEAN 
RANK CHANGES IN Four Groups 








Condition I to | Condition II to 


Condition II Condition II 
(WRITE MORE) (pon'T WRITE) 





Control 

.02 Subliminal 
.5 Subliminal 
Supraliminal 
H 





| 22.28 | +15 | 37.42 
| 28.89 | —8| 22.94 
34.80 | —12 | 16.35 
| 27.00 | +17 | 29.13 
2.61 | | 13.33 
NS | | <.01 





TABLE 5 


TEsts OF SIGNIFICANCE (z) BETWEEN MEAN 
Ranks OF Four Groups 


Group 








Control 
.02° Subliminal 
.5 Subliminal 


2.76* 3.40** 


* Significant below the .05 level. 
** Significant below the .01 level. 


trol group, but the difference between the 
Supraliminal group and the Control group is 
not significant. Although the .5 Subliminal 
group drops more in productivity than the 
.02 Subliminal group, the difference between 
these groups is not significant. 

It would thus appear that the suggestion 
effect only appeared when the suggestion stim- 
ulus was subliminal. The .5 Subliminal group 
merely furnishes a replication of the results 
in the .02 subliminal group. Why was the 
supraliminal suggestion ineffective? A closer 
look at the Supraliminal group indicates that 
the suggestion did affect at least one of the Ss 
in this group. Only three of the eight Ss de- 
creased their productivity én Condition III, 
but one of these Ss complietl in full with the 
suggestion by not writing at all during this 
condition! Her decrease was therefore 100% 
from the previous trial. A second S decreased 
56%. These two Ss showed more response to 
the suggestion than any of the other Ss in the 
experiment. However, their response is masked 
in the group data by four other Ss who showed 
large increases in productivity, or negative 
responses to suggestion. The S who fully 
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complied with the suggestion said: “I know 
it said ‘don’t write,’ so I tried to concentrate 
on the following ones, in case it said ‘write.’ 
I couldn’t see any reason for the words being 
there unless they were directed for me.” 

The S who decreased by 56% felt that the 
words were there to influence her because she 
had read about “subliminal advertising” but 
she wasn’t aware that her productivity had 
been affected by the suggestion. 

The Ss who were contrasuggestive, i.e., in- 
creasing instead of decreasing their productiv- 
ity, found some interpretation for the stimulus 
which explained their disregarding it. 

One S said: “I didn’t know whether it meant 
stop writing or not so I just went on writing 
what I saw.” 

Another S said: “I was just surprised to see 
it there. I didn’t notice it before. I was trying 
to look at the pictures and didn’t pay too much 
attention to it. I thought it meant don’t write 
at that time, not te write until the light went 
off.” 

Another S said: “My first impulse was that 
it was saying ‘Don’t write,’ but due to your 
previous instructions, I went ahead. Then I 
thought it had to do with the picture; the man 
leaving the woman and saying ‘Don’t write.’ ” 

A final question which the experimenter tried 
to answer was the question of whether the in- 
dividual differences in response to the sug- 
gestion stimulus were related to personality 
differences in Ss. A previous study by 
Zuckerman and Grosz (1958) demonstrated 
some relationship between two of the needs 
measured by the Edwards Personal Preference 
Schedule (EPPS) (Edwards, 1954) and re- 
sponse to the sway test (a measure of sug- 
gestibility). The EPPS was administered to 
eight Ss from the two subliminal groups who 
had responded in a contrasuggestive direction 
on Condition III (i.e., increasing rather than 
decreasing their productivity), and to the 
eight Ss from these groups who had shown the 
largest positive response to the suggestion 
(i.e., the largest percentage decreases in pro- 
ductivity). Comparison of these two groups on 
the 15 Murray needs measured by the test 
yielded only one significant difference. The 
contrasuggestive group was significantly higher 
on need Affiliation (p less than .01). Since one 
difference out of 15 comparisons may be due 
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to chance, it may be concluded that no per- 
sonality differences between these groups were 
revealed by the instrument used. 


DISCUSSION 


The data indicate that one of the two sub- 
liminal suggestions influenced the productivity 
of the Ss in describing the pictures. It is not 
clear why the suggestion “Don’t Write” was 
more effective than the suggestion, “Write 
More.” The suggestion Don’t Write seemed 
to have the effect of counteracting a practice 
effect, for the control group steadily increased 
their productivity going from Condition I to 
Condition III. One can speculate that there 
were two opposing motivations in the last 
condition: (a) the usual positive motivations 
of experimental Ss for personal achievement 
or to please the experimenter; (6) a fatigue 
effect toward the end of the experiment re- 
sulting in a disposition to exert less effort. 
The suggestion Don’t Write may have re- 
inforced the increasing disposition to slow 
down. More work is needed to explore the re- 
lationship between subliminal suggestion and 
the intensities and types of needs or motives 
being reinforced. 

The degree of “awareness” or recognition 
plays a crucial role in disputes over the phe- 
nomena of subliminal perception. Goldiamond 
(1958) feels that the phenomena can be ex- 
plained in terms of “partial awareness” which 
the S cannot communicate because of the 
lack of quantitative specificity in the verbal 
indicators used to define “perception.” This 
question can be treated in empirical terms by 
asking “what is the relationship between the 
subliminal suggestion effect and the ‘degree of 
awareness’ of the subliminal stimulus?” This 
question has been investigated in this experi- 
ment by: (a) examining the relationship be- 
tween the magnitude of response to suggestion 
and the threshold for the suggestion stimulus 
and (6) comparing two subliminal groups 
(differing markedly in the time of exposure of 
the subliminal stimulus), a supraliminal group 
and a control group. There is some tendency for 
a greater suggestion effect in the Ss operating 
below, but closer to threshold. However, 
neither the correlation between threshold and 
magnitude of response, nor the difference be- 
tween the .02 sec. and .5 sec. Subliminal groups 
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reached significance. Both of the subliminal 
groups did differ significantly from the con- 
trol group, whereas the supraliminal group did 
not. Ss who became completely aware of the 
suggestion seemed to respond in a dichotomous 
fashion: either conforming strongly with the 
suggestion or responding strongly in a con- 
trasuggestive direction. The latter tendency 
was actually more prevalent. This points up 
the difference between the present experiment 
and those in which the S is given a discrimina- 
tion task. When the S is looking for an answer 
to a problem, he will, of course, use the answer 
that the experimenter gives him, and the 
greater the intensity of the cue the more likely 
he will respond. However, when the subliminal 
cue is not directly relevant to the primary task 
on which the S is focusing, he is quite likely to 
reject or ignore it as an annoying distraction. 
This is what many of the supraliminal Ss 
in this experiment apparently did. The “dis- 
tracting” or “irrelevant” nature of the sub- 
liminal cue may account for the difficulty 
which the majority of Ss had in seeing it even 
though it was exposed for a relatively long 
interval following the picture to which they 
were attending. 


SUMMARY 


Two experiments were performed to study 
the effect of subliminal and supraliminal sug- 
gestion on productivity, or number of words 
written in describing a picture. An experi- 
mental group was given a task of describing 
in writing a TAT picture presented tachisto- 
scopically 10 times at increasing exposure 
levels. In the second condition a subliminal 
stimulus, the words WRITE MORE overlapped 
with the TAT picture for .02 seconds. In the 
third condition the subliminal suggestion was 
pon’r write. A different TAT card was used 
for each condition and the cards were rotated 
among conditions in three orders so that each 
card was presented in each condition an equal 
number of times. The control group performed 
the same task on the three cards but instead of 
a subliminal stimulus only a blank flash oc- 
curred in Conditions II and III. There were 
18 Ss each in the experimental group and con- 
trol groups. Significant differences between 
groups as a function of conditions were found. 
Whereas the control group increased their 
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writing rate steadily from Condition I to Con- 
dition III, the experimental group decreased 
their rate on Condition ITI. 

A second experiment was performed to see 
the effect on productivity of the same sug- 
gestions at supraliminal levels. The exposure 
intervals of the suggestions ‘were increased to 
.5 sec. Ten of the group of 18 tested under these 
conditions still could not report awareness 
of the suggestions, although some of them had 
“partial awareness.”” These Ss showed the 
same effect as Ss in the .02 sec. condition. The 
eight Ss who recognized at least one of the 
suggestions did not show a consistent response 
to them. Some responded in the suggested 
manner, i.e., markedly increasing or decreasing 
their rate on request, while others ignored the 
suggestion or did not interpret it as “instruc- 
tions.” The net effect was closer to the re- 
sponse of the control group than to that in 
either of the subliminal groups. An hypothesis 
formed from these data is that subliminal sug- 
gestion may produce some effect in the region 
just below threshold. When the suggestion 
becomes supraliminal its distracting effect 
causes a contrasuggestive response in some 
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Ss. No marked personality differences (as 
measured by the EPPS) were found between 
suggestive and contrasuggestive Ss in the two 
subliminal groups. 
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BY SCHIZOPHRENICS AND BRAIN DAMAGED PATIENTS' 


LOREN J. CHAPMAN? 
University of Chicago 


ANY Clinical writers have commented 

on an aspect of the thinking disorder 

of schizophrenic and brain damaged 
patients which they describe as a confusion of 
literal usages of words with more figurative or 
metaphoric usages. The present study is an at- 
tempt to investigate such phenomena by means 
of objective measures. 

Clinical writers have seldom defined the 
terms literal, figurative, and metaphoric, but 
have instead relied on the use of example to 
indicate their meaning. However, linguists 
have offered explicit definitions which should 
be of use to clinicians who are concerned with 
this area. 

A figure of speech is usually defined by 
linguists as an extension of the applicability of 
a word. Figures of speech are often divided 
into such categories as metaphor, metonomy, 
and synecdoche (Sturtevant, 1947, pp. 137 
139). Metaphor is vsually defined as the ex- 
tension of ihe applicability of a word in order 
to suggest a similarity or analogy between two 
objects, ideas, or events, e.g., “pull strings” for 
“use influence.” Such metaphoric extensions 
may be: (a) from one sense modality to 
another, e.g., loud color; (b) from one object to 
another within the same sense modality, e.g., 
the use of the term moon to describe man-made 
satellites; (c) from a sense modality to a notion 
that ordinarily is not susceptible to direct 
sensory apprehension, e.g., warm feelings. 
Freshly coined and original metaphors, which 
include those of poetry and other similar litera- 
ture, are readily recognized as such and usually 
require effort on the part of the listener to grasp 
their meaning. With continued usage, a 
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(M-1416) from the National Institute of Mental 
Health of the National Institutes of Health, United 
States Public Health Service. 
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metaphor becomes incorporated into the 
language to the point that it is seldom recog- 
nized as a metaphor, e.g., high pitch. 

The present investigation is concerned with 
metaphors which are fairly well incorporated 
into everyday usage but are still recognizable 
as metaphors. It is primarily concerned with 
the third kind of shift listed here, shifts from a 
sense modality to the description of subjective 
judgments, emotional states, or the quality of 
interpersonal interactions. 

Clinical writers have described the thinking 
disorder in terms of a disturbance in the use of 
this type of metaphor both in schizophrenics 
(Arieti, 1955; Bateson, 1956; Bleuler, 1950; 
Goldstein & Scheerer, 1944; Muncie, 1937; 
Storch, 1924) and organics (Goldstein, 1948; 
Head, 1926; Jakobson & Halle, 1956). All of 
these writers attribute both inappropriately 
literal and inappropriately metaphoric utter- 
ances to patients. However, most of these 
writers (except Bateson and Muncie) state that 
the apparent metaphoric utterances are seldom 
or never truly metaphoric, but are, instead, 
meant in a literal way by the patient. For ex- 
ample, Storch (1924, pp. 5-9) construed a 
schizophrenic’s reference to a person’s train of 
thought as a “thread” and his will as a “spark” 
as referring to sensory images of a physical 
spark and a physical thread without any 
metaphoric intent on the patient’s part. 

Such clinical views seem to imply that if the 
patients were presented with words in context 
and asked to indicate their meaning, they 
would often misinterpret figurative usages as if 
they were literal but would not commit the 
reverse error of misinterpreting literal usages 
as if they were figurative. That is, such a pa- 
tient might make a literal misinterpretation of 
the statement, “The only way Tom can get 
appointed to West Point is to pull strings,” as 
meaning, “It is necessary for him to grab a cord 
in each hand and yank it’”’; however, he would 
not commit the converse figurative misinterpre- 
tation of “pull strings” in the statement, ‘The 





Worp USAGE BY SCHIZOPHRENIC AND BRAIN DAMAGED PATIENTS 


only way for Jim to lace his boots is to pull 
strings.”” Nevertheless, Bateson (1956) and 
Muncie (1937) specifically state that schizo- 
phrenics commit this latter kind of error. 
Muncie believes that it is more prominent in 
schizophrenia than excessive literalness but 
that both kinds of misinterpretation occur. For 
Bateson, the tendency of schizophrenics to 
make both kinds of misinterpretation is a 
central feature of his theory of schizophrenia. 

In the present investigation, schizophrenics, 
brain damaged patients, and normal Ss were 
compared on these two kinds of misinterpreta- 
tion. 


PROCEDURE 
Instrument 


A multiple-choice paper-and-pencil test was con- 
structed. The following examples are representative of 
the items used in the test: 

9. David turned yellow when he faced the enemy. 

This means: 
A. David became cowardly. 
B. David became hungry. 
C. David’s skin became discolored. 
31. Miss Bailey’s illness turned her yellow. 
This means: 
A. Miss Bailey became cowardly. 
B. Miss Bailey became hungry. 
C. Miss Bailey’s skin became discolored. 

In Item 9 above, the correct answer is, of course, 
“David became cowardly.”” Choosing “David’s skin 
became discolored” would reflect an inappropriately 
literal interpretation of the word “yellow.” In Item 31, 
the reverse is true; “Miss Bailey’s skin became dis- 
colored” is the correct answer, and “Miss Bailey be- 
came cowardly” is inappropriately figurative. 

The third alternative, which interprets yellow to 
mean hungry, is neither correct nor a predicted error 
due to misinterpretation. But inclusion of this alterna- 
tive has a special use in the design. Errors are made for 
many reasons besides the kind of misinterpretation 
under investigation, and the bases of these errors are 
mostly unidentified. For example, many schizophrenics 
are noncooperative and respond carelessly, mark ran- 
domly, mark by position, or mark in response to other 
characteristics of the test which are difficult for the 
examiner to identify. If such unidentified errors are 
equally distributed among the three alternatives, then 
a score of predicted error markings minus “unidenti- 
fied”’ error markings should be a relatively pure measure 
of the predicted error tendency. 

Another way of putting it is that there should be, 
by chance, as many errors made for unidentified reasons 
hidden among the predicted misinterpretation markings 
as there are markings of these “unidentified” error 
alternatives themselves. Therefore, we may obtain for 
each S a relatively pure measure of literal misinterpreta- 
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tion if we subtract from the number of his literal error 
markings the number of his “unidentified” error mark- 
ings. This will be called the “corrected literal score.” 
The “corrected figurative score” is obtained by follow- 
ing the same principle. 

The instrument was a 40-item paper-and-pencil test* 
which consisted of 20 items which permit literal mis- 
interpretations and 20 items which permit figurative 
misinterpretations. The items were constructed in pairs, 
as in the example above, with one item testing for a 
literal misinterpretation of a given word or phrase and 
the other for a figurative misinierpretation of that same 
word or phrase. In each case, the “correct” interpreta- 
tion was determined by the context. The 20 figurative 
and the 20 literal items were intermixed in random 
order, with the restriction that the two items testing 
the literal and figurative aspects of the same phrase 
were separated by several other items. The order of the 
three kinds of alternatives was randomized among items 
with the restriction that the two items within each pair 
had the same order. 


Subjects 

Three groups of Ss were tested‘: 36 normal Ss, 41 
brain damaged Ss, and 51 schizophrenics. The normal 
Ss consisted of 23 medical and surgical patients at the 
West Side VA Hospital, Chicago, Illinois, and 13 
Chicago firemen. No normal S was used for whom there 
was any indication either of psychiatric or neurological 
disorder. 

The schizophrenic group, patients at Chicago State 
Hospital, had all received a hospital diagnosis of chronic 
schizophrenia, with their first hospitalization for psy- 
chosis occurring at least five years previously and with 
continuous hospitalization for at least three months 
previous to testing. Almost all of the schizophrenic 
patients had spent most of the last 10 years in the 
hospital. Other prerequisites for selection were absence 
of any indication of either cerebral organic involvement 
or mental deficiency, no current drug therapy, and no 
electric shock therapy within the past three months. 
No selection was attempted by diagnostic subtype. 

The brain damaged patier:ts consisted of 29 patients 
at Hines VA Hospital and 12 patients at Chicago State 
Hospital. The group included 13 diagnoses of cerebral 





*Copies of the test may be obtained from the 
author, or may be ordered from the American Docu- 
mentation Institute. Order Document No. 6237, from 
ADI Auxiliary Publications Project, Photoduplication 
Service, Library of Congress, Washington 25, D. C., 
remitting $1.25 for microfilm or $1.25 for photocopies. 
Make checks payable to Chief, Photoduplication, 
Service, Library of Congress. 

* The author is indebted to Kalman Gyarfus and J. 
Herbert Maltz of the Chicago State Hospital for per- 
mission to test schizophrenic patients for this experi- 
ment, to Gabrielle P. Ferrazzano for permission to test 
patients at the USPHS Hospital, to Roy Brener for 
assistance in obtaining patients at Hines Hospital, to 
Meyer Williams for assistance in obtaining Ss at the 
West Side VA Hospital, and to Robert J. Quinn for his 
help in obtaining the cooperation of Chicago firemen. 
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vascular disorder, 6 of alcoholism, 6 of multiple sclerosis 
with cerebral disorder, 3 of brain tumor, 2 of head 
injury, 1 of Huntington’s Chorea, 1 of syphilitic paresis, 
and 9 of unspecified cerebral damage. The organics, 
as a group, had been hospitalized for a shorter term 
than the schizophrenics. About one-third of the or- 
ganics had entered during the previous three months, 
and none had been hospitalized longer than five years. 

Since these patients were being tested in order to 
study the nature of cognitive disturbance when it 
results from brain injury, only those patients were 
included who appeared by clinical interview tc have 
some intellectual loss or disturbance. This decision 
was usually made on the basis of the patient’s difficulty 
in communication, serious disturbance of memory, 
or severe lack of alertness as observed either in a 
clinical interview or in his performance on the CVS, 
a brief intelligence scale (Hunt, French, Klebanoff, 
Mensh, & Williams, 1948). No patient was included 
in the brain damaged group who appeared also to be 
schizophrenic. 

The mean age was 43.4 years for the schizophrenics, 
44.7 for the brain damaged, and 43.1 for the normals, 
while mean years of schooling was 10.2 for the schizo- 
phrenics, 10.2 for the brain damaged, and 10.6 for the 
normals. Although the groups did not differ significantly 
with respect to age or education, the schizophrenic 
group was lower (p < .001) than the organics on 
functioning IQ as determined by the CVS , and the 
organic group was lower (p < .01) than the normal 
group, presumably due to the disruptive effects of the 
schizophrenic and organic pathologies. The mean IQ 
was 86 for the schizophrenics, 104 for the organics, 
and 113 for the normals. 

The Ss were assumed to be of approximately equal 
premorbid intelligence because of the equivalent 
education. An additional approximation of premorbid 
intelligence was obtained from the item number of the 
last vocabulary word answered correctly. As suggested 
by Becker (1956), this appears to be a better estimate 
than the total number of vocabulary words answered 
correctly because one effect of psychopathology is the 
production of uneveness in performance. On this 
measure, the schizophrenics scored a mean of 23.3, 
the organics 26.2, and the normals 27.7. None of the 
differences was significant. 


RESULTS 

All between-group differences were evalu- 
ated by a two-tailed Mann-Whitney test and 
all within-group differences by a two-tailed 
Wilcoxon test. 

Table 1 lists the median error scores for each 
of the three groups. Each of the three groups 
marked significantly more literal misinterpreta- 
tions than “unidentified” error alternatives 
and significantly more figurative misinterpre- 
tations than “unidentified” error alternatives 
(p < .001 in each case). 
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TABLE 1 


MeEDIAN SCORES O¥ PREDICTED AND 
UNIDENTIFIED ERRORS 








Schizo- Brain 
phrenic Normal 





Figurative error items 
Figurative errors ‘ i. 
Unidentified errors f A 
Corrected Figurative ‘ 1. 
error score (Fig- 
urative minus Un- 
identified) 


Literal error items 
Literal errors 
Unidentified errors 
Corrected Literal er- 

ror score (Literal 
minus Unidenti- 
fied) 





Comparison of schizophrenic and normal per- 
formance. By corrected literal score the 
schizophrenics were more susceptible to literal 
misinterpretations than the normal Ss (p < 
.001). However, there was only a trend towards 
a difference on the corrected figurative score 
(p < .08). It would appear, therefore, that 
the tendency toward literal misinterpreta- 
tions is greater than that toward figurative 
misinterpretations by schizophrenics on this 
test. By comparing, for the schizophrenic 
group alone, the corrected literal score with 
the corresponding corrected figurative score, 
it was found that the literal misinterpreta- 
tions exceeded the figurative (p < .01). The 
normal Ss showed a trend toward the opposite 
error preference, but not significantly so 
(p = .20). 

These findings tend to support Bateson’s 
statements, in that both kinds of misinterpre- 
tations tend to be made more often by schizo- 
phrenics than by normal Ss. The fact that the 
normal Ss also chose the misinterpretations 
more often than the unidentified errors shows 
that the misinterpretations are not unique to 
pathological patients, but rather that their 
greater frequency in schizophrenics is an 
exacerbation of a normal error tendency. The 
greater prevalence of the literal misinterpreta- 
tions is in keeping with the observations of 
Arieti (1955), Bleuler (1950), Goldstein (1944), 
and Storch (1924). This difference in frequency 
between two kinds of misinterpretation on this 
test for a single group of Ss, however, may be 
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a function of the particular literal and figura- 
tive items offered. Therefore, any finding con- 
cerning comparative susceptibilities of any one 
group to the two kinds of misinterpretations 
on this test may be lacking in generality. 
Nevertheless, any finding of a difference 
between groups on this test is not subject to 
this stricture because the same items were 
given to all Ss. 

Comparison of brain damaged patients and 
normal Ss. Unlike these schizophrenics, the 
brain damaged patients showed no tendency 
toward more literal misinterpretations, as 
measured by the corrected error score, than 
the normal Ss (p = .89). They showed only a 
trend toward a greater number of figurative 
misinterpretations than normals (p = .11). 

Comparison of the brain damaged patients’ 
preference for one or the other kind of mis- 
interpretation, as measured by the corrected 
scores, showed that they more often make 
figurative misinterpretations than literal mis- 
interpretations (p < .05). The normal Ss, it 
will be recalled, showed no significant prefer- 
ence, while the schizophrenics showed the 
opposite preference. 

Comparison of schizophrenic and organic 
brain damaged patients. The schizophrenics 
made more total errors than the brain damaged 
patients on both tests, but this may merely re- 
flect differing degrees of pathology between the 
particular samples of patients tested. Compari- 
sons of gross error tendency, which are so com- 
monly reported in the literature, do not appear 
to be meaningful on any single test unless the 
two groups are equated in some way on over-all 
pathology—an extremely difficult task. 

The schizophrenics and the organics can be 
meaningfully compared, however, on their 
relative preferences for the two kinds of mis- 
interpretations. The schizophrenics, as men- 
tioned above, made more literal misinterpreta- 
tions than figurative, while the brain damaged 
patients showed the opposite preference. This 
difference was more directly examined by com- 
puting for each S the difference between his 
two corrected error scores. On this measure, 
the schizophrenics and organics differed 
(p < .001), and they were on opposite sides of 
normal Ss. The schizophrenics differed signifi- 
cantly from thenormalson thisscore, (p < .01), 
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but the brain damaged patients did not 
(p = .26). 

Although the schizophrenics and brain 
damaged patients differed in the kind of mis- 
interpretation preferred, some of the patients 
in each group showed a preference opposite to 
that of their group as a whole. For the schizo- 
phrenics, 61 % preferred the literal misinterpre- 
tation, 25% preferred the figurative, and 14% 
showed no preference. For the brain damaged 
patients, the preferences were 29% for the 
literal misinterpretations, 54% for the figura- 
tive, and 17% with no preference. It is inter- 
esting to conjecture whether the Ss with 
atypical scores in each group may differ in 
other ways from the main body of patients. 
However, these atypical scores may have 
arisen merely from the unreliability of the test. 
For the schizophrenic patients, the Kuder- 
Richardson (Formula 20) reliabilities were .47 
for the corrected figurative error score and .55 
for the corrected literal error score. The cor- 
responding values for the brain damaged pa- 
tients were .49 and .43, respectively. 


SUMMARY 

A test was designed to measure the tendency 
to make inappropriately literal and inappropri- 
ately figurative misinterpretations of written 
statements. The schizophrenic patients made 
significantly more literal misinterpretations 
than figurative, while brain damaged patients 
showed the opposite effect. 

A score of literal misinterpretation minus 
figurative misinterpretation distinguished the 
two groups and placed them on opposite sides 
of the normal Ss. 
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LEARNING A SOCIAL STRUCTURE! 


CLINTON B. DE SOTO 
Johns Hopkins University 


OR some years social psychologists have 

shown increasing interest in mathe- 

matical properties of social structures, 
ranging from investigations of the extent to 
which sociometric choices are mutual to ex- 
perimental manipulations of the centrality of 
communication networks. This interest has 
been accompanied, however, by a curious lack 
of concern with the subjects’ (Ss’) views of 
these properties. For example, in the studies 
of mutuality of sociometric choices, the Ss 
are not ordinarily asked if they expect the 
choices to be mutual. Only recently, in such 
work as Tagiuri’s (1958) generalization of 
sociometry, relational analysis, and Heider’s 
(1958) discussion of naive psychology, has 
there been serious concern with the Ss’ per- 
ception of such properties. 

Apparently psychologists have assumed that 
Ss cannot or do not appreciate these properties. 
Yet folklore is full of essentially mathematical 
statements about interpersonal relations: ‘““The 


way to get someone to like you is to like him,” 


“You can’t serve two masters,” “There’s 
always a faster gun somewhere.” And recent 
questionnaire studies (De Soto & Kuethe, 
1958, 1959) have demonstrated that Ss ascribe 
such properties rather freely to interpersonal 
relations, sometimes accurately, sometimes 
inaccurately. 

Historically social psychologists have re- 
peatedly been compelled to recognize the 
importance of cognitive processes in man, to 
the need, for example, for crediting people 
with role expectations as well as role behaviors. 
This fact makes it appear unwise to ignore 
their readiness to assign mathematical proper- 
ties to interpersonal relations. It makes it 
important to look for effects of such attribu- 
tions of properties in activities besides answer- 
ing questionnaries. There are in the literature 
many findings which are suggestive of such 
effects. For example, Kogan and Tagiuri 

1 This work was supported in part by Grant NSF- 
G4827 from the National Science Foundation. The 


writer is grateful to Martin N. Narun and Daniel M 
Glasner for aid in the collection of data. 
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(1958) found that people expect those they 
choose to choose one another, thus evidencing, 
for a special case, their expectation that 
sociometric choices wili be mutual. 

One task in which Ss’ tendencies to assign 
properties to interpersonal relations might 
be expected to have especially prominent effects 
is that of learning a social structure. Learning 
a social structure means learning the relations 
among a set of people. It is a task that usually 
faces a person when he enters a social group, 
and it can be quite difficult as indexed by the 
information he must assimilate. It is a reason- 
able prediction that his assignments of proper- 
ties to the relations which form the structure, 
operating as expectancies or hypotheses, will 
facilitate or hinder his learning of the structure 
according to their validity. The present study 
was designed to verify this general prediction 
and alse to examine in more detail some of the 
psychological processes involved. 

The study was basically a verbal learning 
experiment in which Ss learned miniature social 
structures. The dependent variable was the 
number of trials required. There were two 
independent variables: the interpersonal rela- 
tion that formed the structure, and the mathe- 
matical oroperties of the structure. In their 
questionnaire study of the properties assigned 
to various interpersonal relations, De Soto 
and Kuethe (1959) found that the relations 
fell into two main classes: those regarded as 
asymmetric, which were also regarded as 
transitive, and those regarded as symmetric, 
some of which were also regarded as transitive 
and some of which were not. For the present 
study, the relation “influences” was chosen 
to exemplify the relations people expect to be 
asymmetric and transitive, “likes” was chosen 
to exemplify those expected to be symmetric 
and transitive, and “confides in” was chosen to 
exemplify those expected to be symmetric 
but not transitive. 

Structures characterized by eight different 
sets of mathematical properties were used. 
Four-man structures were used because they 
were the largest Ss could learn with relative 
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Symmetric, Symmetric. 


Tronsitive. 
Asymmetric, 
Tronsitive, 
Complete. 
(Compiete ordering) 


Control | Control 2 
(Nonsense structures without 


Asymmetric, Asymmetric properties.) 
Complete 
(Dominance structure) 


Asymmetric, 
Transitive. 
(Partial ordering) 


Fic. 1. Structures used in the learning experiment. 


ease in pilot studies. The structures are de- 
picted in Fig. 1, with circles representing 
people and arrows a relation. An arrow going 
from one circle to another means the first 
person stands in the given relation to the 
second; if the relation is “influences,” it 
means the first person influences the second. 
The requirements for the various properties 
may be defined informally as follows: For 
symmetry, the arrows must go in both direc- 
tions, or not at all, between a pair of individ- 
uals. For asymmetry, the arrows must never go 
in both directions between a pair of individuals. 
The requirement for transitivity is more 
complex. If there is an arrow from Individual a 
to Individual 6, and another from Individual 5 
to Individual c, then, for the transitive property 
to hold, there must be one from a to c. For the 
complete property to hold, there must be at 
least one arrow connecting the individuals in 
all possible pairs. More formal definitions of 
these and other properties are given by Coombs, 
Raiffa, and Thrall (1954) and Kershner and 
Wilcox (1950). The names given to some of 
the structures in Fig. 1 are based on their 
properties. 

The task given each S in the experiment was 
to learn one of the structures. He did not see 
a drawing of the structure, however, nor was 
the relation represented by abstract arrows. 
Instead, the arrows were replaced by one of 
the flesh-and-blood relations, “influences,”’ 


“likes,”’ or “confides in,” and he had to learn, 
by repeated trial and error, whether or not 
Jim likes Ray, whether or not Ray likes Stan, 
and so on for the 12 ordered pairs of names 
which exist in a four-man structure. This 
arrangement of the task avoided unintended 
suggestions to the S about properties of the 
structure and approximated the real-life case 
where people learn a social structure by observ- 
ing the pair-wise interactions among its mem- 
bers rather than from devices like organiza- 
tional charts. 

An experimental condition in this study was 
a combination of one of the abstract structures 
and one of the interpersonal relations. There 
were 19 experimental conditions. Each of the 
first three structures in Fig. 1 was formed from 
each of the three relations, producing 9 con- 
ditions, and the remaining five structures were 
formed from the relations “influences” and 
“likes,” producing 10 more conditions. The 
first 9 conditions were the basic experimental 
ones, since a crucial interaction was predicted 
for them. On the basis of the conclusion by 
De Soto and Kuethe (1959)'that college students 
(used as Ss in that study and this) are alert 
to power based orderings, it was predicted 
that they would expect the relation “in- 
fluences” to be asymmetric and transitive and, 
possibly, complete and would therefore learn 
a complete ordering based on “influences” 
more readily than one of the symmetric 
structures, while for “likes’’ and “confides in,”’ 
which Ss expect to be symmetric, the converse 
would be true. The remaining conditions were 
included to provide further analytical pos- 
sibilities. 


METHOD 


The experimental method resembled that of tra- 
ditional paired-associates learning experiments. The 
Ss, male undergraduates at Johns Hopkins University, 
were assigned randomly, nine to each of the 19 ex- 
perimental conditions, and were run individually. 

Their task is described well by the instructions given 
them: 

This experiment is on the learning of interpersonal 
relationships. There are four people involved, Bill, 
Jim, Ray, and Stan, and your job is to learn who 
likes who among these people. 

On the upper side of each of these cards are 
printed the names of two of the four people. In each 
case, the first person either does or does not like the 
second. If he does, it says so on the other side of the 
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card. If he doesn’t, nothing is printed on the other 

side of the card. 

When I tell you to begin, look at the names on the 
top card. If you think the first person likes the second, 
say so. If you don’t think he does, say so. For ex- 
ample, if the first name were Sam and the second 
were Charlie, you would say either “Sam likes 
Charlie” or “Sam doesn’t like Charlie.” Then, turn 
the card over and see whether or not he does, and 
place the card on a new pile. 

Go through the rest of the cards the same way, 
and then I will shuffle them and you can go through 
them again. As we go through the cards again and 
again, you should learn better and better who likes 
who. 

Take your time. Don’t rush. 

The above are the instructions given to all Ss who 
learned a social structure based on “likes.” The instruc- 
tions given to Ss who learned a social structure based 
on “influences” or “confides in” were identical in form 
except that the appropriate one of the following 
definitions was added: “To influence someone means 
to have some degree of control over what he does”; 
“To confide in someone means to have complete faith 
and confidence in him so that you trust him with 
secrets or rely on him for help.” 

Each S worked with a deck of 12 cards appropriate 
for his experimental! condition. 


RESULTS 


The difficulty of learning the various social 
structures was measured by the mean number 
of trials required by the Ss to learn them to a 
criterion of two successive correct trials. These 
means are shown in Table 1. The standard 
errors of the means, highly correlated with 
their values, range from 1.06 to 4.27. 

The hypothesized interaction between rela- 
tions and structures is evident in Table 1. 
The Ss learned a complete ordering based on 
“influences” very readily, but had somewhat 
more difficulty with symmetric structures 


TABLE 1 
MEAN TRIALS TO CRITERION 


Relation 


Likes 


Struct re Con- 


Influences fides in 


14.7 16.2 


Asymmetric, transitive, com- 8.9 
plete (complete ordering) 

Symmetric, transitive 

Symmetric 

Asymmetric, transitive (par- 
tial ordering) 

Asymmetric, complete (dom- 
inance structure) 

Asymmetric 

Control 1 

Control 2 


12.9 
11.7 
12.1 


10.9 9.1 
13.0 10.9 
14.7 

26.4 23.7 
16.3 1 
16.1 6 
16.7 0 
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based on “influences.” They had difficulty 
learning a complete ordering based on “likes” 
or “confides in,” but learned symmetric struc- 
tures based on these relations easily. An 
analysis of variance performed on the data 
for these nine conditions yielded an interaction 
mean square significant at the .01 level. 
Examination of the remaining ten means 
shows that the partial ordering was learned 
relatively easily, at least when based on 
“influences,” but the other conditions were 
all difficult, some of them exceedingly so. 
Particularly striking is the finding that the 
asymmetric and complete structure based on 
“influences” took substantially longer to learn 
than any of the nonsense structures, rather 
than showing facilitation from the fact that 
its properties were partly as expected by the Ss. 


DISCUSSION 


The experimental results clearly support 
the general hypothesis that people’s assign- 
ments of mathematical properties to an inter- 
personal relation influence the ease with which 
they learn a social structure based on the 
relation, and they invite analysis of the psycho- 
logical processes involved. 

Perhaps the most remarkable finding is 
this: although the combination of the asym- 
metric, transitive, and complete properties 
(complete ordering) greatly facilitates the 
learning of a social structure based on “in- 
fluences,” the combination of only the asym- 
metric and complete properties (dominance 
structure) seems to make the learning task 
even more difficult than it is for structures 
without properties. The Ss failed completely 
to profit from the latter combination of 
properties, although in principle it too could 
be a valuable aid to learning a structure. 
Why? 

It appears an inescapable conclusion that 
for the Ss these properties were not indepen- 
dent of one another, even though they are 
independent mathematically, and even though 
Ss are able to ascribe them separately to 
relations when questioned (De Soto & Kuethe, 
1958, 1959). De Soto and Kuethe noted a 
tendency for Ss usually to ascribe the transi- 
tive property whenever they ascribed the 
asymmetric property to a relation. Evidently 
this tendency to couple the properties is so 
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powerful that it prevents people from using 
the properties in isolation when learning a 
social structure. People expect mathematical 
properties in social structures, to be sure, but 
they expect them in bundles, and incomplete 
bundles are of no use to them. 

It is as if the Ss had a theory about the 
social structure, a mathematical model for it 
which was very helpful when its properties 
corresponded with those of the social structure, 
but which had to be applied in an all-or- 
nothing manner in the sense that it was not 
readily abridged in the face of some lack of 
correspondence and became instead a hin- 
drance. Such a theory, marked by certain 
essential properties but doubtlessly skeletal 
and sketchy in other respects, seems most 
aptly called a schema, roughly following Bart- 
let (1932) and Woodworth (1938). 

The present data, then, might be interpreted 
to indicate that the Ss possessed an ordering 
schema which they automatically applied to 
a social structure based on the relation “in- 
fluences,” although not to one based on “likes”’ 
or “confides in.” “Influences” might be said 
to tap or arouse people’s ordering schema. 
This ordering schema, the residue perhaps of 
countless experiences with social and nonsocial 
orderings in the world, seems to be marked 
by the properties of asymmetry, transitivity, 
and, possibly, completeness. Thus the Ss 
expected these properties in a social structure 
based on “influences,” but not as independent 
expectations; rather, as derivatives of the 
overriding expectation that the structure was 
an ordering. Perhaps the dominance structure 
was more difficult than the nonsense structures 
because it perpetually titillated and tantalized 
the Ss’ ordering schema, preventing the sup- 
pression of it that occurred for the nonsense 
structures. 

Evidence that more sophisticated students 
of social structures also possess and can be 
victims of an ordering schema is found in 
many writings about peck rights among birds. 
Birds establish peck rights on the basis of 
pair-wise encounters, a process which insures 
that the eventual social structure is asym- 
metric and complete, but not that it is transi- 
tive. The birds, being simple-minded creatures 
not possessed of an ordering schema, thus 
innocently build a social structure that puzzles 
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and provokes more thoughtful creatures who 
write about the “pecking out of order,” 
“curious exceptions,” “irregularities,” “im- 
perfect orderings,” “lack of a hierarchy,” 
“polygonal dominance,” “cyclic pecking,” 
“triangular pecking,” and “‘complex, triangular 
hierarchies” (Allee, 1931; Collias, 1944; Mur- 
chison, 1935; Murphy, Murphy, & Newcomb, 
1937; Schjelderup-Ebbe, 1935). People con- 
sider this primitive and simple social structure 
surprising and complex because they expect an 
ordering. To turn the thing around, it may be 
that the expectation of an ordering tends to 
produce an ordering, that human social groups 
would show fewer orderings and more domi- 
nance structures and even property free struc- 
tures if it were not for the shared expectation 
of orderings. 

The finding that the partial ordering was 
learned quite readily when based on “in- 
fluences,” with the Ss’ ordering schema pre- 
sumably aroused, suggests that the property 
of completeness is a less integral part of the 
ordering schema than the properties of asym- 
metry and transitivity. However, any partial 
ordering can be mathematically treated as 
made up of two or more complete orderings 
(Coombs & Kao, 1955; Kershner & Wilcox, 
1950), and it is possible that the Ss accom- 
plished essentially this analysis of the partial 
ordering. 

The property of symmetry seems to be 
fairly easy to detect and utilize even when it 
is not expected, as in the structures based on 
“influences.” The results suggest that for 
“likes” and “confides in” a symmetric and 
transitive structure is learned easier than a 
merely symmetric one, but the differences are 
not large, and it should be noted that the 
so-called symmetric and transitive structure 
of Fig. 1 is only vacuously transitive (Kersh- 
ner & Wilcox, 1950), having too few members 
to be truly transitive. Thus, it is impossible 
to say more than that a lack of transitivity is 
not the severe handicap in learning symmetric 
structures that it is in learning asymmetric 
ones. This conclusion is consistent with the 
view expressed earlier (De Soto & Kuethe, 
1959) that the primary property ascribed to 
these relations is symmetry, and that tran- 
sitivity is ascribed secondarily, only after 
long experience which teaches that relations 
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like “likes” are commonly transitive, dividing 
people into cliques or friendship groups. The 
grouping schema, as applied to interpersonal 
relations at least, is a higher development of 
the symmetry schema. 


SUMMARY 


By means of a learning experiment it was 
shown that a social structure is easiest to 
learn when it possesses the mathematical 
properties people expect in it. People attribute 
the properties of asymmetry, transitivity, 
and completeness (the properties of an order- 
ing) to the relation “influences.” If a social 
structure formed of the relation “influences” 
has these properties, they learn it readily. If 
it is symmetric, or if it has no mathematical 
properties, they have more difficulty; and if 
it is asymmetric and complete but not transi- 
tive (a dominance structure), they have great 
difficulty. Similarly, structures based on “likes” 
or “confides in” are learned most readily when 
they are symmetric, as people expect them 
to be. The findings were interpreted as reflect- 
ing the operation of schemas in the Ss. 
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tion and Aggression, Dollard, Dool), Mil- 

ler, Mowrer, and Sears (1939) hypothe- 
sized that the total strength of the “instigation 
to aggression” within an individual is a posi- 
tive function of three aspects of the frustra- 
tions he had suffered: “1) the strength of the 
instigation to the frustrated response, 2) the 
degree of interference with the frustrated re- 
sponse, and 3) the number of frustrated re- 
sponse-sequences” (p. 28). Even accepting 
Miller’s later admonition that these proposi- 
tions refer to the instigation to aggression (for 
simplicity, we might say “aggressive drive’’) 
and not aggressive acts (1941), many critics 
still objected to their failure to differentiate 
among qualitatively different classes of frus- 
trations. For example, Maslow (1941) main- 
tained only threatening frustrations give rise 
to aggression, while mere deprivations pre- 
sumably do not. In the same vein, Pastore 
(1952) proposed that arbitrary frustrations 
are more likely to increase aggressive drive 
than are nonarbitrary frustrations. One reason 
for this, he suggests, is that the arbitrary 
frustrations involve the nonfulfillment of an 
expectancy, and aggressive reactions are said 
to result primarily when the expected attain- 
ment of a goal is blocked. Bateson presented an 
essentially similar hypothesis in his discussion 
of cultural factors affecting the frustration- 
aggression relationship (1941). The Balinese 
apparently do not learn to expect strong satis- 
factions, supposedly because of the frequent 
frustrations imposed upon them by their 
mothers in the course of their socialization. 
Therefore, Bateson argued, “they are in- 
finitely willing to suffer interruption” (p. 353). 
Dollaré and his colleagues generally did not 
consider the operation of expectancies in af- 
fecting the reactions to frustrations, and this 
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lack is particularly felt in their analysis of 
frustrated response-sequences. There is no 
unequivocal support for their principle that 
the strength of the aggressive drive increases 
with the number of specific frustrations. The 
only evidence they give for the hypothesis is 
anecdotal, and experimental tests of the 
proposition have yielded inconsistent results. 
Otis and McCandless (1955) conducted an 
investigation with preschool children placed 
in an eight-trial frustration task. The Ss 
showed significant increases in “aggressive- 
dominant” behavior scores from the first four 
to the last four frustration trials, and showed 
reliable decreases in a class of behavior found 
to be incompatible with aggression, “submis- 
sive-complaisant” behavior. In contrast to 
this support for the proposition, McClelland 
and Apicella (1945) obtained negative findings 
in an experiment employing college students 
on only two frustration trials. 

These opposing results can be reconciled 
by raising the possibility of differences in the 
development of understanding or expectations 
following the first frustrations. The person 
who anticipates further frustrations after the 
first disappointment may not be as upset when 
they occur as one who does not expect them. 
Conceivably, then, to the extent that the 
Otis-McCandless results are correctly inter- 
preted as showing an increase in overt hostility 
with repeated frustrations, their Ss may not 
have expected these continued blockings of 
their goal attempts. The McClelland-Apicella 
Ss, being older and more sophisticated, pre- 
sumably did develop these anticipations. 

Findings consistent with this reasoning were 
reported in an earlier paper by the present 
writer (Berkowitz, 1960a). The Ss received 
two acts from their partners. The first of these 
was either friendly or unfriendly in tone and 
the second (in a factorial design), although of 
a qualitatively different nature, also was 
either friendly or hostile. The data showed 
that the partner’s second act had its greatest 
effect on S’s attitude toward him when it was 
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inconsistent with the earlier behavior. That 
is, if the second act was relatively aggressive 
in manner, it provoked the greatest increase 
in unfriendliness toward the partner (P) when 
it had been preceded by the friendly behavior. 
(However, this was but one case of a more 
general principle: the greatest change toward 
friendliness following the second friendly act 
occurred when this behavior came after hos- 
tility from P.) [t may be that the Ss were 
relatively indifferent to the aggressive (or 
friendly) second act when this behavior was 
consistent with the earlier behavior because 
such actions were more or less expected of P. 

The first experiment in the present report 
employed a procedure virtually identical to 
that used in the earlier investigations, and 
essentially the same results should be obtained. 
Assuming that the initial act from P gave 
rise to an hypothesis about him, there should 
be a greater resulting change in attitude toward 
P following his second communication if this 
behavior was greatly inconsistent with the 
hypothesis than if the act was in accord with 
S’s conception of P. 

This prediction appears to differ from the 
results obtained in a number of studies in the 
area of “person perception.” Typically, in- 
vestigators have found a “primacy effect” in 
which the first-; resented information concern- 
ing a person presumably gave rise to a general 
hypothesis about him, with later information 
then being assimilated to this hypothesis 
(Asch, 1952; Kelley, 1950; Luchins, 1957). 
For example, Kelley led one group of Ss to 
believe that a speaker was a “warm’’ person, 
while other Ss were induced to develop the 
hypothesis that he was “cold.” Later, when 
the speaker delivered his talk, both groups of 
Ss evaluated the speech in a manner consistent 
with their particular hypotheses about him. 
The present prediction, instead of assuming a 
similar assimilation tendency, if anything, 
argues for a “contrast (or recency) effect.” It 
essentially maintains that P’s second action 
toward S is contrasted with the initial expec- 
tation, and as a consequence, influences S’s 
final attitude toward P to a greater extent 
than P’s first behavior. 

The major difference between the present 
studies and the Asch-Kelley-Luchins investi- 
gations involves the discrepancy between S’s 
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initial hypothesis and the information subse- 
quently provided about P. It seems fair to 
assume that this later information in the 
Asch-Kelley-Luchins studies was not regarded 
as very different from S’s first concept of P, 
either because the information was somewhat 
ambiguous in nature or because it truly was 
not greatly inconsistent with the hypothesis. 
In summarizing Luchins’ findings, Hovland 
(1958) stressed the fact that the Ss did not 
consider the second set of information about 
the stimulus person to be incompatible with 
the first presented material, and he suggested 
that this is a limiting condition for the Luchins 
results. The suggestion is entirely in keeping 
with the present argument. There is reason to 
believe that if the Ss had perceived the in- 
compatibility between the two sets of material, 
a contrast would have occurred in which the 
later material would have been judged as 
being even more different from the first pre- 
sented information than was actually the case 
(for example, as defined by independent 
judges (Berkowitz, 1960b]). The probable re- 
sult of this contrast is a recency effect; the 
later information has a greater impact in af- 
fecting the final attitude toward P than does 
the first information. This essentially was the 
finding in the investigator’s previous experi- 
ment (1960a), and presumably it arose from 
just this process: a contrast effect, in which 
P’s second act is evaluated as being extremely 
different from his first communication (and/or 
different from the initial hypothesis formulated 
concerning P), giving rise to the dominant 
influence of this later behavior upon S’s atti- 
tude toward P. 


EXPERIMENT I 


Method 


Procedure. With some exceptions probably net rele- 
vant to our present purposes, the procedure of Experi- 
ment I was identical to that employed in the earlier 
investigation (Berkowitz, 1960a). Summarizing this 
procedure, two Ss of the same sex were brought to- 
gether ostensibly for a study of the formation of “first 
impressions.” After receiving the instructions and a 
brief description of their task, they went into separate 
cubicles where they indicated their impressions of their 
partners on the first questionnaire (Q1). Tliree inter- 
correlated items in this questionnaire index S’s baseline 
level of friendliness toward P. However, the question- 
naire purported to assess S’s judgments of P’s person- 
ality. 
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The Ss then wrote two notes to each other at 5-min. 
intervals in which they described themselves. They 
were to use their partner’s self-descriptions in evaluat- 
ing him on the second questionnaire. EZ substituted 
previously constructed notes for the notes actually 
written in order to produce the initial hypothesis con- 
cerning P. The notes received by the Ss in the Initial 
Hostility condition indicated that the partner had little 
regard for the University and most of its students, 
while Ss in the No Initial Hostility condition were led 
to believe that the partner liked life at the University, 
and was fairly active in student affairs. We assume that 
the former Ss expected P to be an unpleasant or hostile 
person as a result of these notes, while the latter Ss 
expected him to be friendly. Following this, Ss again 
indicated their impressions of their partners on a second 
form of the questionnaire (Q2). The same items used in 
Q1 were employed in testing the success of the initial 
hypothesis-creating manipulation. 

E then collected Q2 and told the Ss that they would 
now be given still more information about their part- 
ners: they would be told what their partners thought 
of them. EZ gave each S a previously completed Q2, 
saying that this was the partner’s second rating of the 
S. Half of the Ss in each of the above conditions (Un- 
favorable Evaluation) received an evaluation with 
uniformly unfavorable ratings, with the others (Favor- 
able Evaluation) getting highly favorable ratings. After 
the Ss had completed the third questionnaire (Q3) and 
an adjective check-list, the deceptions and experi- 
mental procedures were explained and the Ss were dis- 
missed with a pledge of secrecy from them. 

Statistical analyses and dependent variables. The Ss in 
each condition were differentiated according to whether 
their Q1-Q2 changes indicated either a decrease or an 
increase in unfriendliness toward P following his initial 
communications. 5S’s initial hypothesis concerning P 
may be inferred from these changes. A decrease in un- 
friendliness is taken as denoting an Expectation of 
Friendliness, while an increase in this unfavorable atti- 
tude defines an Expectation of Hostility. Within each 
of these groups, the Ss were further subdivided in 
terms of whether their inferred hypotheses were strong 
or weak. They are said to have a Strong Expectation if 
their attitude changes after P’s initial communications 
coincided in direction with the changes displayed by 
the majority of Ss (and therefore, presumably were in 
accord with the objective nature of the notes). The Ss 
exhibiting Q1-Q2 attitude changes differing in direction 
from the majority of Ss’ responses to the particular set 
of notes are defined as having a Weak Expectation.* 

The unfriendliness score involved in these attitude 
changes is based upon the responses to three inter- 
correlated items on the 13-item questionnaire forms. 
2 This reasoning assumes that all Ss recognized, to 
some extent at least, the “objective” nature of P’s 
first communication. (Evidence consistent with this 
assumption is provided below.) Therefore, those whose 
feelings toward P changed in a direction inconsistent 
with this communication are regarded as being less 
certain of their initial hypothesis about P; they pre- 
sumably had detected some evidence contradicting 
their initial belief. 
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The items ask, “Would you admit him (her) into your 
circle of close friends?” ; “Would you like this individual 
as a roommate?”’; and “Would you admit him (her) 
into the social organization of greatest importance to 
yourself?” In each case the answers consist of a check- 
mark placed anywhere on a line ranging from Definitely 
yes to Definitely no, with the score being the distance 
away from the Friendly end of the scale in one-quarter 
inch units summed over the three items. 

In addition to these questionnaire responses, the Ss 
at the end of the experimental session were asked to 
describe P on an adjective check-list. Sixty-seven of the 
adjectives on this list had been shown in previous re- 
search to be particularly sensitive to changes of feeling 
on the evaluation dimension. Using the classifications 
of 63 undergraduate judges as a basis for dividing the 
terms into Good or Bad traits, S’s final level of un- 
friendliness toward P was assessed by adding the num- 
ber of bad traits P was said to have to the number of 
good characteristics he was said not to possess. These 
scores were then “normalized” by means of a square- 
root transformation. For both the rating scale and 
check-list measures, our working assumption is that 
negative evaluations (or unfriendliness responses) are 
indicative of hostility. 

Subjects. There are 160 Ss (half of them women) in 
this first experiment, recruited from introductory 
psychology courses at the University of Wisconsin. 
These Ss were distributed evenly, maintaining a one- 
to-one sex ratio, among the four experimentally-created 
conditions involving the combination of notes and 
evaluations from P. A preliminary analysis of variance 


of the data employing these classifications did not reveal 
any significant sex differences, and this variable is 
therefore ignored in the present report. 


Results 


Effectiveness of the initial hostility manipula- 
tion. There were no significant differences 
among the eight conditions in the mean level 
of unfriendliness toward P on the first ques- 
tionnaire (F = 1.38 with 7 df). The frequencies 
of Ss in each of the conditions, shown in 
Table 1, demonstrate that the initial notes 
generally were successful in establishing the 
intended attitude toward P; by far the greatest 
proportion of Ss changed their attitudes from 
Q1 to Q2 in a direction consistent with these 
communications. (When the various evaluation 
conditions are grouped, since the Ss had not 
received these evaluations at this stage, 
x° = 21.04; p < .001). 

The same conclusion can be drawn upon 
inspection of the condition differences in 
mean changes at this time. If only the Ss who 
had increased in unfriendliness toward P are 
considered, there was a significantly greater 
increase in the Ss receiving the hostile notes 
than in the Ss getting the friendlier communi- 





REPEATED FRUSTRATIONS AND EXPECTATIONS IN HostTiLtiIry AROUSAL 


TABLE 1 


MEAN CHANGES IN QUESTIONNAIRE UNFRIENDLINESS TOWARD P 











I Changes from Q1 to Q2 
Ss who: 


(a) Increased in  unfriendiiness 


from Q1 to Q2 


(b) Decreased in unfriendliness 


from Q1 to Q2 


II Changes from Q2 to Q3 


Inferred initial hypothesis: 


(a) 


Inferred initial hypothesis: 


(b) 


Initial Hostility 


No Initial Hostility 








Unfavorable Favorable 
Evaluation Evaluation 
(A) (B) 


Unfavorable Favorable 
Evaluation Evaluation 
(C) (D) 








(32) (24) 
3.98 3.69 

(8) (16) 
—1.68 —1.91 


(32) (24) 
Strong-Hostility 
0.92,* —3.09, 
(8) (16) 
Weak-Friendly 


(14) (13) 

2.16 2.04 

(26) (27) 
—2.66 —2.45 


(14) (13) 
Weak-Hostility 
2.00.4 —0.84, 
(26) 
Strong-F riendly 
3.26, 0.90, 


2. Wate —1.24, 








* A positive score throughout signifies an increase in unfriendliness toward P. 
* Cells containing the same subscript are not significantly different from each other (at the .05 level) by two-tailed ¢ test. 


cation (t, for the Unfavorable Evaluation and 
Favorable Evaluation conditions are 4.14 and 
3.34, respectively, both significant at p < .01).* 
The differences in attitude change were not as 
great in the case of the Ss who decreased in 
hostility toward P from Q1 to Q2, but again 
the Ss receiving the friendly notes tended to 
become somewhat friendlier (4, = 1.75 and 
1.23, respectively; the first difference is sig- 
nificant at the .10 level). These differences, it 
will be noted, are consistent with the assump- 
tion that attitude changes in keeping with the 
majority reaction to the messages denote a 
stronger expectation than changes deviating 
from the majority reaction. The degree of 
attitude change toward P is likely to be asso- 
ciated with the strength of the opinion S 
formed regarding P. Those who changed more 
may well have formed a more definite hypothe- 


SIS 


Reactions to P’s second acts. Generally, the 
Q2-Q3 attitude changes resulting from P’s 
evaluations of S follow the same pattern as 
the results of the preceding study. The small- 
est shifts in hostility toward P, whether toward 


* The / tests (two tailed) conducted in Experiment I 
had their error terms based upon the “residual mean 
square” from a preliminary “among groups” analysis 
of variance. The tests of condition differences in Ex- 
periment II utilized the Duncan Multiple Range pro- 
cedure (Duncan, 1955), which could be carried out be- 
cause of the equal cell frequencies in this latter study. 


increased friendliness or unfriendliness, oc- 
curred in those conditions where the nature of 
the evaluations received from P was consistent 
with the inferred strong expectations developed 
from his earlier act. Thus, definitely expecting 
hostility from P, the Ss in Condition A(a) 
showed a reliably smaller increase in unfriend- 
liness after P’s unfavorable evaluation of them 
than the Ss in Condition C(b) who presum- 
ably strongly expected a friendlier act from 
him (¢ = 4.59; p = .001). They also exhibited 
a somewhat (but not significantly) smaller 
increase in unfriendliness toward P than the 
Ss in Condition C(a) who had also received 
the unfavorable evaluation from P but who 
had only tentatively expected hostility from 
him (¢ = 1.83; p = .10). This change in Con- 
dition A(a), however, was significantly less 
than that for the Ss in Condition A(b) who 
presumably had a weak hypothesis of friend- 
liness from P (¢ = 2.48; p = .02). 
Essentially the same pattern was displayed 
by the Ss receiving the friendlier favorable 
evaluations from P. This action probably was 
consistent with the definite hypothesis held 
by the Ss in Condition D(b), and they were 
much less likely to change toward friendliness 
following this favorable evaluation than the 
Ss strongly expecting hostility (i.e., those in 
Condition B(a); ¢ = 4.18; p = .001). The 
B(b) Ss, only tentatively expecting friendli- 
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TABLE 2 
MEAN LEVEL OF UNFRIENDLINESS TOWARDS P ON THE 
ApyectiveE CuHeck List ADMINISTERED AT 
THE END OF EXPERIMENT I 
(Square-Root Transformation) 


| Initial Hostility | No, aitial Hos- 


Unfav- 
orable 
Evalu- 
ation 
(A) 


Favor- | Unfav- | Favor- 
able | orable |_ able 
Evalu- | Evalu- |Evalua- 
} -_ } | ton 


Ss who: 

(a) Increased in un- | (32) 
friendliness from Q1 | 5.86 
to Q2 

(8) | 


(b) Decreased in un- 


3.61 | 4.59 | 


friendliness from Q1 | 5.44 


to Q2 


ness from P, apparently were more pleased 
by P’s favorable evaluation of them than the 
former (D[b]) Ss who supposedly had a strong 
hypothesis of friendliness from P (¢ = 3.63; 
p = 01). Similarly, Ss in Condition D(a), 
tentatively expecting hostility from P, also 
became friendlier toward him upon receiving 
this pleasant act than did the D(b) Ss (¢ = 
2.80; p = .01). 

To summarize, these results clearly are in 
accord with the predictions developed on the 
basis of the earlier investigation. Ss receiving 
an act from P consistent with their presumed 
strongly held hypotheses about him showed 
relatively little change in attitude toward him. 
By and large, the greatest change in their 
feelings toward P was savwn by Ss who re- 
ceived acts that deviated from these definite 
hypotheses. 

Cumulative effects of frustration-sequences. 
The data presented up to this point have re- 
ferred to attitude changes. In actuality, of 
course, the Ss’ responses were in the form of 
absolute scores on the three-item rating scale. 
Inspection of the scores on the final question- 
naire tends to confirm the Frustration-A ggres- 
sion hypothesis regarding the cumulative 
effects of repeated frustrations. The Ss in 
Condition A, who had received two unfriendly 
acts from P, had the highest level of hostility 
towards him by the end of the experiment, 
although these scores were not reliably greater 
than those in Condition C. 

Similar results were found with the adjective 
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check-list administered at the end of the ex- 
perimental session, as can be seen in Table 2. 
The Ss in Condition A(a), who had perceived 
two clear attacks upon them, described the 
attacker in a significantly more unfavorable 
manner than the Ss perceiving only one definite 
attack. (Comparing the former condition with 
B(a), ¢ = 4.94, p < .01, and with Conditions 
C(a) and C(b), it is 3.78, p < .01, and 4.38, 
p < .01, respectively.) 

Nevertheless, it is clear from the previous 
analysis that this final level of hostility in 
Condition A(a) might not have increased 
much further with additional frustrations. S 
conceivably could learn to expect the frus- 
trations as they are repeated so that their ef- 
fect is lessened. Hcwever, this is not to suggest 
that a long series of frustrations would neces- 
sarily produce ever-diminishing reactions with 
the development of these expectations. The 
person may reach a “breaking point” after 
continually failing to reach his goals, for 
example, as McClelland suggests (1951), when 
he fears he cannot at all cope with the situa- 
tion confronting him. His reaction then could 
well be a great surge of anger toward the 
frustrator. 


EXPERIMENT II 


The next experiment was designed to test 
the possibility that contrast effects in judg- 
ments of behavior are less likely to occur when 
S has a very general hypothesis about P. 
Conceivably, the difference between the fore- 
going results and the Asch-Kelley-Luchins 
findings might stem from differences in 
hypothesis-width. The Ss in the latter inves- 
tigations formed general impressions of the 
stimulus person, while Ss’ initial expectations 
concerning P in Experiment I might have been 
very restricted in scope, pertaining more 
directly to P’s probable next act rather than 
to what he was like in general. If so, broadening 
this hypothesis to cover P as a person might 
result in an assimilation effect in which P’s 
second act is interpreted in a manner con- 
sistent with the hypothesis. 


Method 


Procedure. Male and female college students who 
had volunteered to serve in an experiment on “person- 
ality factors in the ability to assess others’ personalities” 
first participated in a group testing session in which they 
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TABLE 3 
MEAN CHANGES IN LEVEL OF UNFRIENDLINESS TOWARDS P ON THE ADJECTIVE CuEcK Lists 
Experiment IT 








Hostile Expectation 


Friendly Expectation 





Hostile Note 


Friendly Note Hostile Note Friendly Note 





Impression: Absolute score on 
CL-1 
Change after expectation created 
Change after note 


5.6 9.0 8.3 6.2 


9.3" 8.3 
—1.3,° 


—2.7 
4.3, 


—2.7 


—5.45 0.44,. 





Note.—N = 9 in each condition. 


* In all cases a positive score signifies an increase in unfriendliness toward P, while a negative score denotes a decrease in this attitude. 
* Cells containing the same subscript are not significantly different from each other (at the .05 level) by Duncan Multiple Range Test. 


responded to a personality inventory. Some time later 
two Ss of the same sex, but who did not know each 
other, were scheduled for a given laboratory hour. When 
they arrived they were reminded of the ostensible pur- 
pose of the study and then ushered into separate rooms. 
There, each S indicated his first opinions of the other 
person (P) on a 33-item adjective check-list (CL-1), 
supposedly in order to demonstrate the accuracy of his 
first impressions, but actually to provide a baseline 
level of friendliness toward P. 

Ss were then given instructions designed to create 
initial expectations regarding P. After saying he would 
give them more information about their partner, Z 
showed each S the partner’s “personality profile” sup- 
posedly prepared by professional psychologists on the 
basis of the earlier test results. This “profile” consisted 
of graphic ratings on five bipolar traits. On four of these 
traits (for example, “introversion,” “self-insight,” etc.), 
the ratings indicated P was in the “normal” or “aver- 
age” range. The fifth rating was varied with the experi- 
mental condition. Half of the Ss were led to believe P 
was “extremely friendly” (Friendliness Expectation), 
while the others were shown that he was “very hostile” 
(Hostility Expectation). Once EZ was sure that the 
ratings were understood he gave S an alternate form of 
the adjective check-list (CL-2), saying he was to evalu- 
ate the partner “on the basis of all the information 
he now possessed.” ‘The scores obtained at this time 
were used to test the success of the manipulation of 
expectation. 

In the last part of the experiment S and his partner 
were to write a note to each other in which they were 
to talk about themselves. The Ss were told the note 
from P would provide them with further information 
about P. In actuality, however, they were given the 
previously prepared communications employed to ma- 
nipulate attitudes in the preceding study. Half of the 
Ss in each of the above conditions received the message, 
now combined on one card, used to increase unfriendli- 
ness (Hostility Note), and the remaining Ss received 
the friendliness-increasing communication (Friendly 
Note). 

After the notes were written, “exchanged” and read, 
each S rated his partner again on an adjective check- 
list (CL-3) that was composed of both alternate forms 
employed before. Four of the nine Ss in each experi- 
mental condition were males. 

Dependent variables. As in the first experiment, the 


unfriendliness score obtained from the check-list was 
computed by summing the number of unfavorable 
traits attributed to P with the number of good charac- 
teristics he was said not to possess. In testing the effect 
of the expectancy-creating statements, the CL-1 score 
for each S was subtracted from his CL-2 score. The 
final score, denoting the change in unfriendliness created 
by the communication from P, was obtained by con- 
sidering only those adjectives in the final check-list 
that had been used in CL-2. The CL-2 score was sub- 
tracted from this score to yield the measure of the reac- 
tion to P’s behavior. 


Results 


The statistical findings obtained with the 
adjective check-list measures of unfriendliness 
are summarized in Table 3. Analysis of vari- 
ance of the change scores following the expec- 
tation manipulation indicate that this treat- 
ment was highly successful. Ss given the hostile 
expectation increased considerably in the un- 
favorableness of their descriptions of P, while 
those given the friendly expectation tended to 
develop a somewhat more favorable image of 
him. (This difference is significant at p < .01). 

The reactions to the communication from 
P, shown in the last row of the table, generally 
were consistent with the results of the pre- 
ceding study. The hostile note created the 
greatest increase in unfriendliness when it was 
sent to the Ss having the friendly expectation, 
while there was the greatest decrease in un- 
friendliness following the friendly note when 
the message was received by Ss presumably 
not expecting this type of behavior. 

The implication of these findings is clear. 
It is unlikely that the previous results can be 
attributed to narrow hypotheses restricted to 
P’s probable next act. The contrast effects 
consistently obtained in these studies ap- 
parently arise because the second (or the to-be- 
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evaluated) actions differ greatly from the 
actions expected by S. 
DISCUSSION 

The findings of these experiments can 
readily be understood as instances of judg- 
mental phenomena. As is well known, when a 
person makes a judgment he essentially com- 
pares the stimulus with some standard, and a 
contrast effect occurs in this process to the 
extent that the stimulus differs from the 
standard (Berkowitz, 1960b). That is, S 
evaluates the difference as being greater than 
it really is (as defined, for example, by an ob- 
server not affected by the standard). In the 
present case, S’s expectation or hypothesis 
concerning P can serve as the standard against 
which P’s subsequent behavior is evaluated. 
When P’s action is altogether inconsistent 
with S’s hypothesis there apparently is an 
analogous contrast effect, and the action has a 
considerable impact upon S’s final attitude 
toward P. This judgmental analysis suggests, 
furthermore, that it may not be necessary to 
differentiate strong from weak hypotheses, a 
distinction, it will be recalled, introduced in 


Experiment I in connection with the corre- 


spondence between S’s and the majority 
reaction to P’s initial communication. For 
example, S was said to have only a tentative 
expectation concerning P if his attitude change 
toward P following this communication dif- 
fered in direction from that exhibited by the 
majority of Ss in his condition. Subsequent 
behavioral deviations from these presumably 
weak hypotheses did not have the same effect 
generally as deviations from supposedly 
stronger hypotheses. However, these perhaps 
unparsimonious interpretations concerning hy- 
pothesis strength can be avoided by making 
a somewhat different set of assumptions. 
Here we assume that the unfavorableness of 
S’s initial hypothesis about P can be coordi- 
nated directly to the changes in his question- 
naire ratings of P from Qi to Q2. Thus, the 
greater the change in an unfriendly direction, 
the more unfavorable is S’s hypothesis. If 
P’s second action be given an arbitrarily 
maximum score (say, 10) if it is aggressive in 
tone, and a score of 0 if it is friendly, the 
magnitude of the difference between this 
initial hypothesis and P’s subsequent behavior 
can then be computed. Extrapolating from 
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judgmental research showing that the extent 
of the contrast effect is in direct ratio to the 
discrepancy between the stimulus and the 
standard, the greater the difference between 
the initial hypothesis and the later action, the 
greater should be S’s attitude change toward 
P following this action. Testing the prediction 
via the condition means shown in Table 1, 
the obtained correlation (tau) is +.71, which 
is significant at approximately the .03 level. 
The analogy therefore seems to hold. Unex- 
pected behaviors may well be evaluated as 
being even more discrepant from the hypothe- 
sis than they really are, with the degree of the 
enhanced discrepancy a function of the degree 
of unexpectedness. 

This reasoning has a direct bearing upon the 
hypotheses advanced by Dollard and his 
colleagues. Instead of supporting the Pastore- 
Bateson contention that it is necessary to 
distinguish among classes of frustrations be- 
cause those that are unexpected are more 
likely than anticipated deprivations to pro- 
duce aggressive reactions, the present findings 
can easily be incorporated into the Yale 
framework. According to Dollard, Doob, et al. 
(1939), the strength of the instigation to ag- 
gression is a positive function of the degree 
of frustration. We now can say that as a result 
of the contrast effect, unexpected frustrations 
are more likely to give rise to hostile responses 
because they probably are evaluated as being 
more severe than expected frustrations. In 
other words, there is a greater perceived inter- 
ference with S’s goal attainment if the frustra- 
tion is not expected than if it is anticipated 
because of the operation of judgmental proc- 
esses upon the evaluation of the frustrating 
event. 

The present analysis also questions the 
possibility raised by Miller and Campbell 
(1959) that there may be a “general primacy 
principle” which, at times, is masked by 
recency effects. The occurrence of primacy or 
recency may depend, in part at least, upon the 
perceived discrepancy between the first and 
later-presented information. The first material 
presumably gives rise to an hypothesis. In a 
sense, this expectation serves as the judgmental 
standard against which the later information 
is compared. If the later material appears con- 
sistent with the hypothesis (i.e., is close to the 
anchorage-expectation) it is incorporated into 
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the hypothesis, and a primacy effect is said to 
take place; the first information has had 
greater weight than the later information in 
influencing the final evaluation of the stimulus 
person, situation, or issue. On the other hand, 
if the later material is seen as deviating con- 
siderably from the hypothesis (and assuming 
it is not rejected), there is a contrast effect 
exaggerating the difference between the two 
sets of information. In this case, the later 
material may have a greater impact upon S’s 
final attitude toward the stimulus object, not 
only because it is fresher in his memory, as 
Miller and Campbell point out, but because of 
the contrast-enhanced evaluation of the ma- 
terial. 
SUMMARY 

Two experiments were devised to test pre- 
dictions developed on the basis of an earlier 
investigation (Berkowitz, 1960a). In this study 
employing pairs of Ss it was found that the 
partner’s act had its greatest effect on S’s 
final attitude toward him when it was incon- 
sistent with the partner’s earlier behavior. 
For example, if the second act was relatively 
aggressive in manner it provoked the greatest 
increase in unfriendliness toward the partner 
(P) when it had been preceded by friendlier 
behavior. The first experiment reported here, 
employing a procedure virtually identical to 
that used in the previous study, obtained es- 
sentially similar results. 

The second experiment, yielding the same 
pattern of findings, showed that these results 
cannot be attributed to “narrow” hypotheses 
on the part of Ss in the first study, restricted 
to P’s probable next act. 

These contrast effects appear to differ from 
other findings in the area of “person percep- 
tion”. The discrepancies are interpreted in 
terms of a judgmental analysis. This judg- 
mental analysis was also employed to make 
the present findings consistent with the frus- 
tration-aggression hypotheses advanced by 
Dollard, Doob, et al. (1939). The present 
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paper suggests that unexpected frustrations 
produce a stronger aggressive reaction than 
anticipated frustrations because, as a result 
of the hypothesized contrast effect, the former 
probably are evaluated as being more severe. 
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FRUSTRATION, AGGRESSION, AND MURDER! 


STUART PALMER 
University of New Hampshire 


N THE study reported here, 51 male murderers 
I and their 51 nearest-age brothers were investi- 
gated with respect to frustration in infancy, child- 
hood, and adolescence as an influencing variable 
(cf. Dollard, Doob, Miller, Mowrer, Sears, Ford, 
Hovland, & Sollenberger, 1939). Frustration was 
defined as lack of satisfaction of an innate or 
learned need. A murderer was defined as a person 
who has been legally convicted of first or of second 
degree murder or of first degree (non-negligent) 
manslaughter. The central hypothesis was: There 
is a significant positive relationship between the 
amount of frustration experienced by persons in 
infancy, childhood, and adolescence, on the one 
hand, and whether or not they later commit mur- 
der, on the other hand. Aggression release during 
the preadult years was considered a major control 
variable. 


METHOD 


The records of all males serving prison sentences 
for murder or first degree manslaughter in Massa- 
chusetts, New Hampshire, Maine, and Rhode 
Island during the fall of 1957 were investigated. 
In order to be used as cases in this study, it was 
necessary that, as well as having brothers, the mur- 
derers have mothers who were available for inter- 
viewing. Of 254 imprisoned men, 76 met the follow- 
ing Criteria: 1. under 51 years of age; 2. court 
records indicated that the subject was believed to 
have physically committed the murder; 3. at 
least one brother who had attained the age the 
murderer was when he committed murder; 4. 
mother living; 5. mother able to answer questions 
in English; and 6. mother residing within 150 
miles of Durham, New Hampshire. Twenty-five of 
the 76 mothers were not ava lable for interviewing, 
leaving 51 Ss in the muraerer group. Where a 
murderer had more than one brother who fitted 
Criterion 3 above, the brother nearest the murderer 
in age was chosen, to yield a control group of the 
same size. None of these control brothers was 
known to have committed murder. Use of the 
brothers as control Ss made it possible to hold rela- 
tively constant such factors as age, sex, social 
class of parents, religion, and the like. 


1 Supported in part by four grants from the Central 
University Research Fund of the University of New 
Hampshire. 


Some data, such as intelligence quotients, were 
obtained from the prison records of the murderers. 
Most were obtained through interviewing the 
mothers with the use of a questionnaire composed 
of 143 items, a majority of which were close-ended. 
Each mother completed the questionnaire twice, 
once for the murderer son, once for the non- 
murderer. The order of questioning about sons was 
reversed with alternate mothers. 

Indices of Physical Frustration, Psychological 
Frustration, General Frustration, Socially Ac- 
ceptable Aggression Release, Socially Unacceptable 
Aggression Release, and General Aggression Re- 
lease were developed. In the Index of Physical 
Frustration, the S was allotted one point for the 


following: 


1. A difficult birth 

2. Serious effects of forceps at birth 

3. Each serious operation 

4. Each serious illness 

5. Each serious accident 

6. Each serious beating by someone other than a 
parent or step-parent 

7. One or more serious beatings by the mother; and 

8. One or more serious beatings by the father or 
step-father; with respect to this index as well as to 
those below, words such as difficult and serious were 
operationally defined 


In the Index of Psychological Frustration, one 
point was allotted the S for the following: 


1. Each serious, physical defect that could give rise 
to social disapproval 

2. Very rigid time schedule adhered to by the 
mothers in caring for the child during his first two 
years 

3. Bottle-feeding during some portion of the first 
two months of life 

4. Fixed schedule followed by mother in feeding the 
child during the first two months 

5. Final weaning started before age 12 months 

6. Toilet-training begun before age 12 months 

7. Sexual] seli-play stopped by mother before age 12 
months 

8. Mother responded to child very often by crying 
during first five years of his life 

9. Mother responded to child very often by isolating 
herself during first five years of his life 

10. Each trauma due to natural events or other indi- 
viduals, excluding beatings 

11. Great difficulty learning to read or never learned 
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12. Persistent bed wetting 
13. Persistent stuttering; and 
14. Spent a majority of time playing alone at age five 


An S’s score on the Index of General Frustration 
was found by adding together his scores on the 
Index of Physical Frustration and the Index of 
Psychological Frustration. 

In the Index of Socially Acceptable Aggression 
Release, one point was allotted the S for the fol- 
lowing: 


1. Verbal loudness at age five classified as loud or 
average 

2. Anger short of temper tantrums at age five once 
a week or more 

3. Reasonable verbal release at age five which oc- 
curred half or more of the times when angry 

4. Physical fighting at age five once a week or more 

5. “Talking back” to the mother at age five occa- 
sionally or more 

6. Dirtying clothes at age five, half or more of the 
times when “dressed up” 

7. Participation in athletics at age 17 occasionally 
or more 

8. Hunting at age 17 occasionally or more; and 

9. Mildly antagonizing others by verbal means at 
age 17 occasionally or nore 


In the Index of Socially Unacceptable Aggres- 
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sion Release, one point was allotted the S§ for the 
following: 


1. Lying to the mother at age five occasionally or 
more 

2. Stealing at age five occasionally or more 

3. Swearing at age five occasionally or more 

4. Temper tantrums once or more during childhood 

5. Deliberately and seriously hurting animals once 
or more during childhood or adolescence 

6. Fist-fighting once or more at age 17 

7. Violent aggression other than fist-fighting when 
intoxicated, before age 20 


Whether a behavioral form indicated socially ac- 
ceptable or unacceptable aggression release was 
determined arbitrarily by the investigator. An S’s 
score on the Index of General Aggression Release 
was found by summing his scores on the Index of 
Socially Acceptable Aggression Release and the 
Index of Socially Unacceptable Aggression Release. 


RESULTS 


As Table 1 shows, significantly more murderers 
than control brothers scored high on the Indices of 
Physical, Psychological, and General Frustration. 
Further, when General Frustration scores within 
pairs of murderers and brothers were compared, it 
was found that in 42 pairs (82%) the murderer had 


TABLE 1 
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a higher score than his brother. In five pairs the 
scores were equal, and in four, the brother’s score 
was higher. 

The data in Table 1 also show that a signifi- 
cantly greater number of control brothers than of 
murderers had high scores on the Index of Ac- 
ceptable Aggression Release. Conversely, a signifi- 
cantly greater number of murderers than of 
brothers had high scores on the Index of Unac- 
ceptable Aggression Release. On the Index of Gen- 
eral Aggression Release, scores were approximately 
the same for the murderers as for the brothers (in 
spite of the fact that the Chi square value was sig- 
nificant at the .05 level). 

Although detailed data are not included here, 
the relationship between general frustration and 
murder was investigated while holding general ag- 
gression release approximately constant. This was 
done by dichotomizing the total group of 102 sub- 
jects into two equal sub-groups; one, composed of 
51 murderers and brothers who used general ag- 
gression releases extensively (score of 4 or more on 
the Index of General Aggression Release), and the 
other composed of 51 murderers and brothers who 
used general aggression releases infrequently (score 
of three or less). Frustration was positively asso- 
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ciated with murder whether aggression releases 
were used extensively or infrequently. However, 
the association was statistically significant only 
when aggression releases were employed extensively. 


DISCUSSION 


Clearly, there are three especially major draw- 
backs in this study. First, the mothers may not 
have reported the facts correctly. Second, the group 
of murderers is probably not representative of any 
larger murderer population. Third, the indices of 
frustration and aggression release are crude; they 
do not take into account the fact that one incident 
of frustration or aggression release may not equal 
the weight of another such incident. However, as 
an exploratory study where more satisfactory data 
are difficult to obtain, it indicates the possibility of 
a functional relationship between preadult frus- 
tration and later presence or absence of murder. 
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SOME PHYSIOLOGICAL CORRELATES OF RORSCHACH BODY IMAGE 
PRODUCTIONS! 


A. D. DAVIS 
University of Texas 


ECENT investigations have given rise to the 
R contention that some aspect of personality 
organization reflected in Rorschach percepts is in- 
volved in the selection of psychosomatic symptoma- 
tology (Cleveland & Fisher, 1954; Fisher & Cleve- 
land, 1955, 1956, 1957). This personality variable is 


1 This article is based on a dissertation submitted to 
the faculty of the department of psychology of the Uni- 
versity of Texas in partial fulfillment of the require- 
ments for the PhD degree. The author is indebted to 
Philip Worchel, Seymour Fisher, and Sidney E. Cleve- 
land for their support, aid, and supervision in this 
study. 

2 Now at the Department of Psychology, Western 
State Hospital, Ft. Steilacoom, Washington. 


concerned with the individual’s perception of his 
body exterior as having (or lacking) protective 
characteristics. The relative strength of this “body- 
image perception,” according to Fisher and Cleve- 
land (1957), can be quantified as a “barrier score” 
by an appropriate scoring procedure. This barrier 
score has been related to the projective perform- 
ance of clinical groups manifesting specific psycho- 
somatic symptom complexes, and it has been 
argued that these bodily perceptions predispose 
normal individuals to symptoms at either an 
“interior” or “exterior” bodily site through 
heightened physiological response tendencies 
(Fisher & Cleveland, 1955, 1957). 

This formulation suggests that the greater 
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number of Rorschach responses which emphasize 
“barrier” (impermeable, enclosing, or otherwise 
protective) surface qualities, the greater is the 
physiological reactivity. toward the periphery of 
the body (skin and musculature). Conversely, the 
relative absence of these productions is related to 
increased physiological reactivity at an interior 
body site. The present study was formulated as 
a means of investigating this hypothesized rela- 
tionship in normal subjects. 


METHOD 
Subjects 


Group Rorschachs were administered to three 
sophomore psychology classes totaling 280 students 
at the University of Houston. Of these Ss, only 
the 146 males were utilized because of the nature 
of the physiological recording. From Ss having 
more than the median number of barrier responses, 
25 were randomly selected and designated as the 
“high barrier” group (Group I); of Ss with less 
than the median number of barrier responses, 25 
were randomly selected and designated as the “low 
barrier” group (Group II). 


A pparatus and Physiological Measures 


Physiological indices were obtained by the 
following methods and apparatus: 


1. Electromyogram (EMG). Recordings of muscle 
action potential were obtained from two EMG elec- 
trodes fixed by a head band placed two inches apart 
over the frontalis muscles of the left eye. The muscle 
potentials were fed into an electronic current integrator 
after the design of Jacobson (1951) and this integral re- 
corded on a standard Grass ITI-C, 7 channel, ink-writ- 
ing oscillograph. 

2. Skin Resistance (GSR). A measure of skin resist- 
ance was obtained from two 2-inch sponge electrodes 
placed on the dorsal and palmar surface of the left hand. 
The basic circuit of the GSR unit utilized for recording 
is reported by Davis and Porter (1931). 

3. Blood Pressure. Systolic (SBP) and diastolic 
(DBP) blood pressures were obtained by the cuff and 
auscultatory method from the right arm. Mean blood 
pressure (MBP) was computed as the average of the 
systolic and diastolic pressures. 

4. Ballistocardiograph Variables (BCG). Four ballis- 
tocardiograph indices were utilized after the fashion of 
Funkenstein, King, & Drolette (1954): (a) An absolute 
count of pulse rate (P). (6) The mean height of the 
BCG-IJ wave was determined over the arbitrary time 
span of the IJ waves utilized to define stroke volume 
(IJ). The IJ wave-form characteristic in the BCG rec- 
ord is the most conspicuous characteristic of the record 
and it is generally agreed to be the headward ballistic 
reaction of the body to ventricular systole (Dock, Men- 
delbaum, & Mendelbaum, 1953). This variable has been 
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empirically related to ventricular stroke volume by 
Starr, Schnabel, Mayock, & Austin, 1952). (c) The 
average height of the IJ unit was multiplied by the pulse 
rate over unit time (IJ X P). This measure was em- 
pirically related to cardiac output by Nickerson, War- 
ren, & Brannon, 1947). As outlined by Funkenstein 
(1954), these variables were utilized without the em- 
pirically derived constants of Starr and Nickerson 
(1952) as purely relative physiological indices with no 
interpretation of absolute physiological meaning or 
value intended. (d) The mean blood pressure divided 
by the average height of the IJ waves multiplied by 
pulse defined peripheral resistance (MBP/IJ X P). 
This measure has similarly been empirically related to 
the total peripheral resistance of the cardiovascular 
system. 


The instrument used to obtain these records was 
a critically damped accelerometer consisting of a 
two-gram weightgpendulum which was suspended 
from a magnetic phonograph cartridge mounted in 
a block of wood 18 X 4 X 2 inches. The pendulum 
weight was immersed in a glycerin bath which pre- 
vented reverberation of the pendulum, thus giving 
clean records of the body ballistics. The accelero- 
meter was designed to give identical wave form 
characteristics to that of the Virgil Hill Electro- 
magnetic BCG (Dock, Mendelbaum, & Mendel- 
baum, 1953). The apparatus was placed across the 
legs at the ankles and recorded on the Grass 
oscillograph. 

From these measures, two groups of three vari- 
ables each were assigned to exterior and interior 
classifications. The rationale for this division was 
dependent upon the bodily site at which the 
greatest variation of the measures occurred. The 
exterior variables and the physiological process to 
which they are related were as follows: (a) muscle 
potential (EMG), (6) skin resistance (GSR), and 
(c) peripheral resistance (MBP/IJ XP). 

The interior measures included (a) pulse rate 
(P), (6) stroke volume (IJ), and (c) cardiac output 
(IJ XP). Systolic and diastolic blood pressures 
(SBP) and (DBP) were obtained in order to 
compute the mean blood pressure (MBP). 

According to the formulation, Group I (high 
barrier) individuals should show greater respon- 
sivity in the exterior group of variables, and Group 
II should reflect relatively greater physiological 
response on the measures designated as interior. 


Procedure 


Each S was instructed to lie down on a one-half 
inch horsehair pad on the floor. A sandbag was 
placed beneath his heels to lock his knees for the 
ballistocardiograph recording, and the remainder 
of the equipment was affixed while the investigator 
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casually engaged him in conversation about 
neutral topics. S was then instructed to relax and 
rest while the investigator obtained “recordings 
of his heart.” An initial rest period of 20 min. was 
given in an attempt to establish conditions which 
were as basal as possible for resting-level measures. 
During the last 10 min. of the rest period, GSR 
readings were recorded, and the blood pressure was 
sampled three times. At the end of the rest period, 
a 2-min. sample recording of EMG and BCG was 
obtained and utilized as rest level measures. At 
this time, another sample of GSR and blood pres- 
sure was taken and averaged with the initial meas- 
ures as an estimate of resting level. 

Six arithmetic problems from the Wechsler 
Intelligence scale were read from cards. S was 
given only 30 sec. for each solution. The first two 
answers were accepted as correct and the last four 
as incorrect, regardless of their nature. The prob- 
lems were read readily, and the investigator’s 
standard comments after each “failure” were 
designed to be increasingly more depreciating. The 
GSR reading was noted after presentation of each 
problem, and the maximum fall in resistance 
during the problem period was considered as the 
maximum stress response on that variable. After 
presentation of the last problem, blood pressure 
was immediately taken, and a sample recording of 
BCG and EMG was obtained. These measures 


were utilized as the poststress (or response to 
stress) measures. 


RESULTS 


The scores of the combined groups on each 
variable were linearly transformed into distribu- 
tions having 2 mean of 50 and a standard deviation 
of 10, making it possible to compare the individ- 
ual’s performance on any variable with his per- 
formance on any other relative to the sample 
mean of the variable. As a means of evaluating the 
differences in the groups’ resting levels, ¢ and F 
tests of significance were performed between groups 
on each of the variables in the pre-stress condition. 
The significant data are presented in Table 1. 
Marked differences occurred between the two 
groups on their resting measures. Group I was 
significantly higher on one exterior measure 
(EMG) at the .02 level with a one-tailed test. The 
Group II resting response level, as predicted, 
tended to be higher than that of Group I on the 
interior measure of P. However, the outcomes for 
Ij and IJ X P were reversed with Group I having 
the higher mean. Interestingly, the observed blood 
pressure measure, SBP and MBP, approached 
significance in favor of Group II and appeared to 
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TABLE 1 


TRANSFORMED PREsTRESS MEANS, STANDARD 
DEVIATIONS, AND SIGNIFICANCE 
or DIFFERENCES 








Group I Group II 





Mean; SD 


Mean; SD 


53.06 46.94 


48.37 


9.50 
9.64 
9.10 





























have greater variance than any other Group II 
performance. Similarly, Group I variance on the 
blood pressure measures was among the smallest 
of that group’s performances. Inasmuch as the 
standard deviation for the distribution of the 
groups combined is equal to 10, discrepancies from 
this value are some relative indication of the 
lability of the physiological variable for the group. 

To make comparisons between the groups on 
their relative physiological reactivity to stress, pre- 
and postdifference scores were corrected for 
differences in initial resting level for a regression 
analysis. The procedure was recommended by 
Lacey (1956), who pointed out that the high 
correlation between resting and stress levels pre- 
disposes percentage or absolute-change data 
toward reflecting initial differences. The result of 
this treatment yielded a distribution of “Autono- 
mic Lability Scores” free of dependence on base 
level and expressed in units which make it possible 
to compare relative responsivity on a number of 
different physiological variables. 

Table 2 summarizes the data for the Autonomic 
Lability Scores. It is apparent that Group I re- 
sponded to the stress situation in terms of the 
exterior variables with greater EMG, GSR, and 
MBP/IJ XP values than those of Group II. Al- 
though the GSR difference was not significant, the 
difference was in the predicted direction. With a 
one-tailed test, EMG and MBP/IJXP may be 
considered as significant at the .01 and .06 levels 





CRITIQUE AND NOTES 


TABLE 2 


Autonomic Lasirry Score MEANS, STANDARD 
DEVIATIONS, AND SIGNIFICANCE OF DIFFERENCES 
BETWEEN GROUPS 








Group I Group II 





Mean| SD 


Mean; SD 


j 
9.60 | 6.58 
5.74 
9.65 


9.34 


8.58 


45.89 | 9.20 |S3.73 | 9.26 | 7.64 
48.66 |10.84 [51.27 | 8.54 | 2.61 
46.30 |10.15 [53.98 | 7.72 | 7.68 


* Two-tailed test due to significant F ratio. 





























TABLE 3 


RANK ORDER OF RESPONSIVITY AND COEFFICIENTS OF 
THE Two Groups On EAcu OF THE EXTERIOR AND 
INTERIOR VARIABLE IN PRE- AND POSTSTRESS 

anp Autonomic LaBritiTy MEASURES 


Prepost Auto- 
nomic Lability 
Scores Ranks 





Group 
I 


Gro Grow 
Group I n Group I Il ° 





MBP/IJ xX P 


Interior 
P 
VJ 
Ij xP 


rho:* —1.00 —.% —.83 
Standard error - ll 133 





* rho of .829 needed at .05 level of significance; .943 needed at 
#1 level. 


respectively. Conversely, Group II responded in 
terms of the interior variables with P (¢ of 2.03), 
IJ (¢ of 2.17), and IJXP (2.07) all significant 
beyond the .02 level. A one-tailed test was not 
reported on the IJ variable because of the signifi- 
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cant difference in variances. The blood pressure 
measures, SBP and MBP, as in the resting level, 
were greater (.01) in Group II. 

With the raw data transformed in such a way 
as to permit the relative order of responsivity 
across variables to be determined, the mean rank 
of each group on each variable was found and 
rank order (rho) correlations computed. These 
ranks and their rho coefficients for the prestress, 
poststress, and pre- and poststress regression 
analyses (Autonomic Lability Scores) are pre- 
sented in Table 3. As the table suggests, there was 
a marked contrast in the form of a high inverse 
relationship. There was a perfect negative correla- 
tion between the order of Group I and Group II 
in the resting level; for the poststress and difference 
analyses, the rhos of —.86 and —.83 respectively 
were both significant beyond the .05 level. 


DISCUSSION 


The results indicate that the differences at 
resting level between the groups are not com- 
pletely in accord with the hypotheses. For Group I 
(high barrier), however, the EMG measure is 
greater than that of the Group II (low barrier) 
measures. Similarly, for Group II, P is greater than 
in Group I with no evidence of differences between 
the groups on GSR, MBP/IJ xP, or IJ XP. It 
will be noticed that IJ xP (cardiac output) enters 
into both of the nonsignificant cardiac variables. 
It appears from inspection of the underlying 
variables (P and IJ) that the cardiac output, 
although not different, may be achieved in different 
ways during the resting measures. In Group I, it 
seems to be primarily a function of significantly 
higher stroke volume (IJ) and slow pulse; whereas 
in Group II, the cardiac output appears to result 
from low stroke volume and a significantly more 
rapid pulse. Additionally, the pre- to poststress 
means suggest that all cardiac activity in Group I 
decreases from rest to stress. This is undoubtedly 
related to the increase in this group’s peripheral 
resistance, which would tend to impede any cardiac 
output. Nevertheless, cardiac activity in Group II 
seems excessive under stress in comparison to the 
increase shown by Group I Ss. On the other hand, 
the cardiac organization of Group I appears to 
becomes more efficient under stress, possibly 
through a slowing of the pulse, allowing greater 
ventricular filling before systole. These observa- 
tions reflect highly significant differences in the 
organization of the two groups’ cardiovascular 
functioning. Such characteristic differences in 
cardiovascular organization across individuals 
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were similarly reported by Lacey (1950, 1953) 
with Ss under stress. 

To summarize the resulting physiological 
hievarchies of response of the two groups under 
stress: Group I (high barrier) reacts in terms of 
increased muscle potential and peripheral vaso- 
constriction, a decreased skin resistance, increased 
stroke volume of the heart, increased cardiac out- 
put, and increased pulse rate in that order. The 
Group II (low barrier) hierarchy of response under 
stress consists of an increase of stroke-volume, 
pulse rate, and cardiac output, and a decrease in 
GSR, increased peripheral resistance, and muscle 
potential in that order. These hierarchies clearly 
demonstrate a primacy of response in the exterior 
indicies for the high barrier group and for the 
interior indices in the low barrier group when both 
groups are under stress. These data tend to lend 
support for the hypotheses of this study if the 
hypotheses are restricted to considerations of the 
stress data. The resting level data do not support 
the contention that these physiological predisposi- 
tions are characteristic of the two groups under 
basal conditions. 


SUMMARY 


The relationship between surface characteristics 
of Rorschach responses and physiological measures 
was investigated in order to test the hypothesis 
that the projected perception of one’s body is 
related to the patterning of physiological responses. 
Nine physiological measures were obtained under 
rest and experimentally induced stress from 50 Ss 
selected at random from a larger sample of 146 
normal male subjects. Physiological indices related 
to bodily exterior (muscle potential, skin resistance, 
cardiovascular peripheral resistance) and bodily 
interior (pulse rate, stroke volume, cardiac output) 
physiological responsivity were abstracted from 
the nine measures. The results indicate that the 
tendency to give impermeable or protective char- 
acteristics to Rorschach percepts is related to 
heightened physiological reactivity at an exterior 
body site and that the absence of these characteris- 
tics in Rorschach productions is related to 
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heightened physiological response on _ interior 
indices of physiological response under stress. 
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UMEROUS studies reporting correlations be- 
N tween conductance or palmar sweating 
changes and changes in anxiety stimuli or self- 
report of tension have encouraged the view that 
conductance level, from which the changes have 
been noted, represents the anxiety or tension! level 
of an individual. But studies attempting to deter- 
mine the function relating conductance levels and 
manifest anxiety are rare. Hence, it is a dubious 
procedure to infer that one individual is more 
anxious than another on the evidence that his 
conductance leveis are higher. Moreover, lacking 
knowledge of the function between anxiety and 
conductance level, it is, strictly speaking, im- 
proper to interpret changes in level as unequivocal 
evidence of changes in anxiety. 

The following study was designed to clarify this 
problem. Specifically it was planned to obtain data 
on two questions: 

1. If Individual A has the same prestimulus 
level as B, do the two have the same manifest 
anxiety level? 

2. If A has a different level than B, does the one 
with the higher conductance also have the higher 
anxiety level? 


METHOD 


Thirty-two men, all graduate students, and 
eight women, students or students’ wives, aged 21 
to 28, served as paid subjects (Ss). 

While Ss played cards, electrodes were fastened 
to the plantar surfaces of both feet. Essentially 
simultaneous recordings were taken from groups 
of eight Ss by a method described elsewhere 
(McDonnell & Carpenter, 1959). Recordings 
lasted for 15 min. for each of the two sessions 
(spaced one week apart). The individual’s mean 
conductance level for both sessions was taken as 
his prestimulus conductance level. Manifest 
anxiety level was obtained by administration of the 
Mandler-Sarason General Anxiety scale at the 
second session. 


RESULTS AND DISCUSSION 


The questions prompting this inquiry were (a) 
if A and B have the same conductance level, do 
they report the same anxiety, and (6) if B has a 
higher level than A, does he also report greater 
anxiety? 


1 The study assumes, without proof, an equivalence 
between “anxiety” and “tension.” “Anxiety” will be 
used hereafter for both. 


Answers are obtained by referring to the rela- 
tion between anxiety and conductance shown in 
Fig. 1. Each plotted point represents an average 
of the conductance level of those Ss whose anxiety 
scores were found in a given anxiety interval. The 
correlation ratio was .722 (P < .01 with 39 df). 

Thus, it appears that if A and B both have high 
conductance levels, they are apt to report moder- 
ate anxiety ; but if they have low conductance levels, 
S could report either high or low anxiety. A similar 
difficulty obtains in predicting differences in 
anxiety from the fact that B has a higher con- 
ductance than A. The higher conductance signifies 
higher anxiety only when both Ss report relatively 
low anxiety. If both report high anxiety, the 
higher conductance would be expected in the S 
with lower anxiety. 

The evidence that low conductance may be 
associated with high anxiety corresponds to two 
previous observations, Van der Valk and Groen 
(1950) found that Ss with diseases supposedly 
symptomatic of high tension (ulcerative colitis, 
peptic ulcer, diabetes, etc.) showed lower conduct- 
ance levels than a group of nondiseased norraals. 
Similarly, Mowrer, Light, Luria, & Seleny (1953) 
reported an overtly high anxious patient who had 
“phenomenally” low levels of palmar swe ~ +g. 
The idea that similar conductance levels can 
characterize Ss with different anxiety levels is 
supported by the same author, who noted that 
individuals differing widely in tension level showed 
nearly identical sweating levels. 

Greater awareness of anxiety may be accom- 
panied at first by greater autonomic activity. 
Thereafter, some process, possibly an extension of 
homeostatic restraint as conceived by Lacey 
(1956), intervenes to modulate sympathetic 
effects. The result is a lowering of conductance 
levels. 

Although this study only furnishes evidence on 
the meaning of conductance levels, the finding of a 
curvilinear function suggests caution in inter- 
preting conductance changes. Depending on S’s 
level of anxiety, a decrease in conductance could 
signify an increase in anxiety for one S and just the 
reverse for another. Illustratively, Mowrer found 
with some patients that a decline in sweating was 
accompanied by an increase in reported tension. 
Similarly, as other investigators (Van der Valk & 
Groen, 1950) have interpreted their results, an 
increase in conductance could, in some Ss, mark a 
decrease in tension. 
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REPORTED ANXIETY 


Fic. 1. The relation between skin conductance ard manifest anxiety. 


The results of this study require two qualifica- 
tions: size and nature of sample, and the validity 
of the anxiety scale. 

Sample size is important where correlation 
ratios are concerned, because small frequencies in 
the grouping intervals exaggerate the coefficient. 
Although care was taken to use gross categories, the 
number of Ss at the extremes of the distribution 
were nevertheless small. Again, the twin circum- 
stances of being voluntary and paid might have 
selectively influenced both conductance and 
anxiety score. However, the absence of satisfactory 
norms for conductance or anxiety levels makes 
identification of such influences impossible. In- 
cluding results of both sexes seemed warranted for 
two reasons: females resembled the males in range 
of both anxiety scores and conductance levels, and 
they exhibited the same curvilinear relation be- 
tween the two variables. 

While reliability of the Anxiety scale is satisfac- 
tory, validity is not so well established (Mandler & 
Sarason, 1952). The lower anxious group was 
found here to be more heterogeneous in conduct- 
ance level than the higher anxious group, possibly 
indicating that such low scores include both 
genuinely low anxious Ss and Ss who are denying 
or suppressing anxiety. It is equally possible that 
the greater variability of this group results from 
(a) weakness of discrimination in the Anxiety scale 


or (6) lack of correspondence between anxiety and 
conductance at the low ena of the anxiety contin- 
uum. The data do not permit a decision between 
these alternatives. 


SUMMARY 


This study investigated the functional rela- 
tionship between conductance levels and an index 
of manifest anxiety. A curvilinear relationship was 
found and discussed in relation to the use of 
conductance level as an index of emotionality. 
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N INVESTIGATION of 250 patients with ma- 
lignant disease (LeShan, 1956) indicated a 
pattern in 62% of this group and in 10% of the 
controls which included a psychologically trau- 
matic event early in life and a marked amount of 
disturbance in the individual’s ability to relate to 
others. A study by Reznikoff (1955) of women with 
cancer of the breast similarly revealed that a 
significantly higher proportion of cancerous women 
than controls had had a sibling die in childhood. 
It was decided to test the hypothesis that the 
cancer patient has suffered a psycholigical blow 
in childhood more often than chance would allow 
by making a specific prediction from it. The pre- 
diction is that the birth of the next youngest 
sibling occurred earlier in the lives of patients 
with cancer than in those of controls. The reasoning 
is basically simple. The birth of a younger sibling, 
with the consequent perceived loss of parental 
energy and time, is a traumatic event. Other 
things being equal, the earlier this occurs, the 
more severe the trauma. The child has had less 
time to be the baby and is younger and therefore 
less able to cope with his emotions; the younger the 
child, the more his feelings are on a preverbal 
and magical level and, therefore, more difficult 
to deal with. 


METHOD 


It was decided to choose a group of families 
which had been carefully evaluated from a medical 
standpoint, and which had at least one cancerous 
individual among the offspring. A set of pedigrees 
was chosen from Jacobsen’s (1946) extensive data.* 
He studied 200 cancer patients, treated in Copen- 
hagen hospitals, on whom very complete family 
records were available. Certain of Jacobsen’s 
families were excluded. These were families in 
which the cancerous individual was the youngest 
sibling, a twin, or an only child, and those where 
the ages of some of the siblings were missing. 

The controls were noncancerous offspring in 
the same families. It was believed that using these 


1 This paper was written under the auspices of a 
grant from the Ayer Foundation, Inc. 

* The writers wish to express their appreciation to 
the staff of the library of the American Cancer Society 
for their courtesy and help in selecting this set of 


pedigrees. 


subjects would minimize genetic factors and 
make any systematic impact of other carcinogens 
in the data quite improbable. 

As an additional test of the hypothesis, data on 
the Ss used in Reznikoff’s study (1955) were 
reviewed with specific respect to the next youngest 
sibling. The Ss in this research were 50 women 
attending the Breast Clinic of Memorial Center 
for Cancer and Allied Diseases, New York, for 
the first time. It was eventually found that 25 
of these women had malignant neoplasms, while 
the remaining half were subsequently diagnosed as 
having benign disorders. Also included in this 
study were 25 women from the Strang Detection 
Center declared to be entirely free of any breast 
pathology. The three groups were approximately 
the same in their age distributions. There were no 
group differences in average number of siblings, 
although the cancer Ss significantly less frequently 
had two or fewer sibs. The cancer and benign Ss 
were of comparable economic status, having been 
screened to insure that their incomes met the 
eligibility requirements of the Breast Clinic. 


RESULTS 


A chi square test of the frequencies in Table 1 
shows that there is a statistically significant 
difference in duration of being the youngest child 
between the cancerous individuals and the con- 
trols. Individuals with cancer tend to have a 
shorter period of being the youngest child than 
do their cancer-free siblings. It is to be noted, 
however, that the difference falls almost entirely 
within the first interval of 10-23 months, which 
encompasses only 27% of the cancer cases and 9% 
of the controls. 

As can be seen from Table 2, a significant differ- 
ence was also obtained between Reznikoff’s three 
groups, compared for age of next youngest sibling, 
using an analysis of variance design. From Table 3, 
which contrasts these three groups considered 
two at a time, it is further revealed that the 
cancer Ss were significantly closer in age to their 
next youngest sib than were the cancer-free in- 
dividuals. Neither the comparison between the 
cancer and benign disorder groups nor between 
the benign disorder and cancer-free Ss resulted in 
significant differences, although the former ap- 
proaches significance. 
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CRITIQUE AND NOTES 


TABLE 1 
FREQUENCY OF INDIVIDUALS WITH AND WITHOUT CANCER AS RELATED TO THE NUMBER OF 


MONTHS BETWEEN Own Birtu AND THAT OF THE NEXT YOUNGEST SIBLING* 














Months Between Own Birth and That of Next Youngest Sibling 





10-23 


Individuals with cancer 
Cancer-free siblings 


x* = 18.32, df = 5, > < .O1. 
* Jacobsen’s families. 


TABLE 2 

CoMPARISON OF CANCER, CANCER-FREE, AND BENIGN 
Disorper Groups wits RESPECT TO THE NUMBER 
or Montus Between Own BIRTH AND THAT OF 

THE Next YOUNGEST SIBLING 








Source of Variation a Mean Square 





Between groups 2 2558.51 
Within groups 53 


9 < 05. 


TABLE 3 
ComPaRISON OF CANCER, CANCER-FREE, AND BENIGN 
DisorpER Groups CONSIDERED Two aT A Time 
wirn REsPecT TO THE NuMBER OF MONTHS 
BETWEEN OWN BIRTH AND THAT OF NEXT 


difference 
months 


Cancer with cancer-free 

Cancer with benign dis- 
order 

Benign disorder with can- 
cer-free 





Note.—Means: cancer 26.53 months; cancer-free 49.26 months; 
and benign disorder 42.00 months. 


DISCUSSION 


A growing body of literature suggests that 
there are some correlations between personality 
organization and the occurrence of malignant 
disease. Two recent reviews (Kowal, 1955; LeShan, 
1959) show that a large diversity of studies, using 
different methods, have come to this conclusion. 
Some studies seem to show indirectly that an 
early emotional trauma is found in a significant 
percentage of individuals who later develop cancer. 
Evans (1926) states that her cancer patients, who 
were in Jungian depth analysis, often regressed to a 
primary identification with the mother. In breast 
cancer patients, Bacon, Rennecker, and Cutler 
(1952) found a great deal of unresolved tension 
with the mother-figure. Cobb (1952) stated that 
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her cancer patients regarded emotional relation- 
ships as dangerous and showed an absence of 
cathectic attachments. The papers of LeShan and 
Worthington (1956) and of Reznikoff (1955) 
point directly to an early traumatic event as 
being correlated with neoplastic disease. 

It should be made clear that there is no implica- 
tion here of a single cause of cancer. Our data 
suggest that under certain conditions, early emo- 
tional trauma may interact with other factors 
and play a part in the etiology of a later malig- 
nancy. The possible meanings of psychological 
events in the development of cancer have been 
partially explored by Meerloo (1954) and others, 
but much further research is needed to expand our 
gradually increasing awareness that psychological 
factors apparently must be viewed as being of 
some significance in the development of malignant 
disease, and to integrate this knowledge with the 
physiological findings in this field. 
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SOME RELATIONSHIPS BETWEEN GSR CONDITIONING AND 
JUDGMENTS OF ANXIETY’ 


HAROLD GILBERSTADT anp GENE DAVENPORT 
University of Minnesota and Veterans Administration Hospital, Minneapolis 


REVIOUS studies have demonstrated differences 

in conditionability between samples of indivi- 
duals grouped on the basis of anxiety. Classical con- 
ditioning procedures have been used, employing the 
eyelid reflex (Spence & Farber, 1953; Spence & 
Taylor, 1953; Taylor, 1951; Taylor & Spence, 
1954) as well as the galvanic skin response (GSR) 
(Bitterman & Holtzman, 1952; Howe, 1958; 
Lykken, 1957; Welch & Kubis, 1957). The samples 
have included both normals and psychiatric pa- 
tients. Anxiety has been assessed by self-inven- 
tories such as the Taylor Manifest Anxiety Scale 
(MAS) (Taylor, 1953) and by behavior rating 
scales such as that of Buss, Wiener, Durkee, & 
Baer (1955). 

The mechanics of conditioning procedures have 
varied, However, two publications (Moeller, 1954; 
White & Schlosberg, 1952) have presented evi- 
dence that the probability of successful condi- 
tioning is increased by using a one-half second 
interval between the onset of the conditioned (CS) 
and unconditioned stimulus (US). 

The aim of the present study was to investigate 
further the relationship between GSR conditioning 
and various clinical judgments of anxiety in a 
sample of psychiatric patients conditioned under 
presumably optimal procedures. 


METHOD 


Sample. Nineteen out of 23 consecutive admis- 
sions to the open wards of the Minneapolis VA 
Hospital constituted the sample. The four patients 
who were not used include * two who showed no 
GSR to shock or buzzer and two who had to be 
eliminated because of apparatus irregularities. The 
sample included three depressive reactions, three 
psychophysiologic reactions, seven anxiety reac- 
tions, two emotionally unstable personality reac- 
tions, and one each of paranoid state, adult situa- 
tional reaction, and incipient schizophrenic 
reaction. Mean Shipley-Hartford IQ was 112 on the 
Sines (1958) norms, with a range of 96 to 126. 
Mean age was 32 with a range of 21 to 57. 

Apparatus. The apparatus consisted of a Fels 
Dermohmeter 22A and an Esterline-Angus re- 
corder. A 2 cm. electrode was centered on each 
palm, and the discharge from a 2.0 microfarad 
condenser charged from a 75 volt source was 


1 We are indebted to J. J. Jenkins from whom we re- 
ceived helpful suggestions. 
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applied across the third phalanges of the second 
and fourth fingers of the dominant hand. 

Conditioning. Instructions were given which 
represented the investigation as a routine hospital 
test of physiological functioning. The conditioning 
procedure followed Lykken (1955, 1957) with the 
exception that a one-half second CS-US interval 
was used in place of the five-second interval. 
Adaptation consisted of ten trials in which a 
buzzer (No. 1) and a different-toned buzzer 
(No. 2) were presented intermittently but not 
followed by shock. Conditioning consisted of 
shock reinforced presentations of buzzer No. 1, 
interspersed with four nonreinforced presentations 
of buzzer No. 2. Extinction consisted of 24 pres- 
entations of the two buzzers without shock. 

Measurement. The principal measure employed 
was an adaptation of the conditioning score devised 
by Lykken (1955) and consisted of the difference 
between (a) the ratio of the average GSR to 
buzzer No. 1 on the first three presentations of that 
buzzer during extinction, divided by the average 
GSR to shock on the last three presentations of 
shock during conditioning and (5) the ratio of the 
average GSR to buzzer No. 1 on the last three 
adaptation trials, divided by the GSR to shock on 
the first three conditioning trials. 

Assessments of clinical anxiety. First, three 
clinical psychologists without knowledge of condi- 
tioning results independently categorized patients 
as high, medium, or low anxiety types by inspec- 
tion of psychology folders containing admission 
notes and routine admission tests, including the 
MMPI. No projective tests were available to the 
judges. Second, the Buss (1955) rating procedure 
was used by the psychiatrist in charge of the case 
who rated his patient on the basis of his brief 
admission interview. Finally, patients were sorted 
into three approximately equal groups on the basis 
of the MAS items from the MMPI. 


RESULTS AND DIscUSSION 


The relationships between the conditioning 
measure and the various anxiety measures are 
shown in Table 1. It can be seen that high, medium, 
and low anxiety groups as ranked by Clinician A, 
Clinician B, and the combined ratings of the three 
clinicians were significantly different in condition- 
ability ( < .05) by the Kruskal-Wallis Analysis 
of Variance by Ranks test (Siegel, 1956). Neither 





CRITIQUE AND NOTES 


TABLE 1 


Ratep ANxreTy OF Patients OrpERED AccoRDING TO CONDITIONING RANK 














| Kruskal-Wallis Test 





Anxiety Criterion 
Clinician A 
Clinician B 
Clinician C 
Combined Clini- | 

cians 

MAS | 

Psychiatric Rating | 

Diagnosis :> 


| os 
H 
lp | 


ZEK Rhine 


P 


12.00 
6.4% 
4.78 
8.77 


2.91 
17 








Zz REEE 

















Note.—H = High anxiety rating; M = Medium anxiety rating; L = Low 


* No MMPI taken by this patient. 
> N = neurotic; P = psychotic; C = character disorder. 


the rankings by the MAS nor the psychiatric 
ratings were significantly related to GSR condi- 
tioning. 

Spearman rank correlation coefficients (Siegel, 
1956) between basal resistance and the condi- 
tioning measure (r = —.22) and between basal 
resistance and the average shock response (r = 
—.04) were not significant, nor were there differ- 
ences in extinction between the various anxiety 
groups. 

It has been noted (Cronbach & Meehl, 1955) 
that while studying the validity of clinical con- 
structs, it is important also to scrutinize the 
criterion measures. The results suggest that 
ratings of clinical anxiety by experienced clinicians 
based on psychological tests and history data 
(including the MMPI) covary with conditioning 
of the GSR. There is a suggestion in the present 
investigation, as in other recent investigations 
(Goldberg, 1959), that studies employing judges 
should have some basis for inferring the expertness 
of the judges with the particular instruments. In 
the present instance, Clinician A, who was pre- 
sumed to be most competent and most experienced 
with the MMPI was apparently most successful, 
and Clinician C, who was presumed to be least 
competent and least experienced with the MMPI, 
was apparently least successful (See Tabie 1). 

Present findings are consistent with those of 
Bitterman and Holtzman (1952), in which judg- 
ments of anxiety by clinicians with several types of 
information available were significantly related to 
GSR conditioning, whereas the MAS was not. 

The psychiatric ratings based on clinical obser- 
vation from brief admission interviews were not 
significantly related to conditioning in the present 
instance, although in the Welch and Kubis study 
(1947) clear-cut relationships were obtained, 


possibly because the psychiatrists had greater 
familiarity with the patients. 

One factor which could in part account for the 
lack of significant differences in extinction between 
groups was revealed during posttest interviews. 
Some patients stated that during the long series of 
buzzers in the extinction trials they “expected” a 
few shock stimulations. Cook and Harris (1937) 
have shown that such a factor can override the 
effects of conditioning and extinction. 


SUMMARY 


Using presumably optimum GSR conditioning 
procedures with a one-half second US-CS interval 
and an ongoing sample of 19 psychiatric patients 
rated for clinical anxiety, (a) anxiety groups 
ranked on the basis of admission psychological data 
by clinical psychologists were found to be signifi- 
cantly different in conditionability; (5) anxiety 
groups ranked either on the basis of the Taylor 
Manifest Anxiety Scale or brief psychiatric admis- 
sion interviews were apparently not significantly 
different on conditioning; and (c) anxiety groups 
revealed no significant differences in extinction. 
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VERBAL REINFORCEMENT IN SCHIZOPHRENIA! 


EDWIN COHEN 
Veterans Administration Hospital, Iowa City, lowa 


awn BERTRAM D. COHEN? 
Lafayette Clinic, Detroit 


HE common clinical observation of impaired 

“reality contact” in schizophrenia implies 
a deficit in the intensity or direction of respon- 
siveness to social cues. Social cues, and specifically 
those involved in verbal interactions, have been 
shown to provide reinforcement effects in a wide 
variety of experimental settings (Salzinger, in 
press). If schizophrenic patients manifest faulty 
responsiveness to social cues, it may be predicted 
that the reinforcing function of listener reactions 
(i.e., verbal reinforcement) will be negligible for 
them as compared with nonschizophrenic control 
Ss. The present experiment was designed to test 
this prediction. The particular technique used 
was one with which reinforcement susceptibility 
had been previously demonstrated for general 
medical patients (Cohen, Kalish, Thurston, & 
Cohen, 1954) and for psychiatric patients selected 


! Portions of the present paper were read at the 
Midwestern Psychological Association meetings in 
St. Louis, Missouri, May, 1956. 

* At VA Hospital, Iowa City, Iowa, when study was 
conducted. 


on the basis of anxiety level scores rather than 
diagnosis (Taffel, 1955). 


METHOD 


Apparatus. The materials consisted of 160 3x5 
white index cards, on each of which was typed a 
common verb (used by Cohen et al., 1954). All 
verbs were in the past tense. Six pronouns—I, We, 
He, They, She, and You—were placed below each 
verb. The order of pronouns was randomized for 
the series of 160 cards. 

Subjects. Ss were 32 male hospitalized patients 
from the psychiatric service of the VA Hospital in 
Iowa City. All were capable of reading and free 
of CNS pathology. By unanimous agreement of 
the stafi—two psychiatrists and two clinical 
psychologists—half of the Ss had been diagnosed 
as schizophrenic, the other half as neurotic. Both 
groups were unselected as to subtype of illness and 
were comparable in age and education. Mean age 
of the schizophrenics was 32, while that of the 
neurotics was 32.4; mean education was 11.3 and 
10.5 years for the schizophrenics and neurotics 
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respectively. The groups varied significantly in 
the number of days they had been hospitalized; 
schizophrenics mean time was 130 days, with a 
range of 10 to 523, whereas the neurotics were 
hospitalized for a mean of 57 days with a range of 
18 to 166 days. 

Procedure. Each S was informed that he was 
participating in a research project. He was then 
shown a sample card and instructed to make up a 
sentence containing the verb on the card and 
beginning with any one of the pronouns. S then 
constructed sentences in this fashion until the 160 
cards were completed. E recorded each sentence 
verbatim. The cards were shuffled for each S 
to randomize the effect of order. 

The 16 Ss in each diagnostic category were 
randomly divided into two equal groups. One 
group in each diagnostic category was reinforced 
for any sentence begun with I or We, and the 
other group for similar use of He or They. During 
the acquisition phase (the first 80 cards), E said 
“Good” in a flat, unemotional tone following any 
sentence which S began with the pronouns chosen 
to be reinforced. In order to determine a base 
level without reinforcement, however, reinforce- 
ment was not administered until after the first 20 
card presentations. 

The last 80 presentations constituted the ex- 
tinction phase of the experiment. During this 
phase, reinforcement was discontinued for all Ss. 
In order to test S’s awareness of the relation 
between his responses and those of EZ, each S was 
asked at the completion of the series which pro- 
noun he had used most, as well as specific questions 
about his ideas concerning the purpose of the 
experiment and £’s responses. 


RESULTS 


In the treatment of the data, the number of 
reinforced pronouns occurring within 8 successive 
blocks of 20 cards each were used as the criterion 
measures, taking the first four blocks as the 
acquisition and the last four as the extinction 
phases. A summary of the analysis of the acquisi- 
tion trials is shown in Table 1. Neither Methods 
nor any of its interactions with other factors 
approaches reliable significance. It may therefore 
be safely assumed that the Ss conditioned equally 
to both classes of pronouns, I-We and He-They. 
The F values for Trials and Trials-by-Diagnosis 
are each significant at the 2% level. It seems, 
therefore, that conditioning did occur, but it 
occurred differentially for the two diagnostic 
groups. Since it appeared from inspection of Fig. 1 
that the curve for the schizophrenic Ss was not 
ascending, the data for this group alone were 
further analyzed. The two methods of conditioning 
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were combined, since the initial analysis had shown 
no differences attributable to methods. The 
F value for Trials was clearly not significant, 
allowing the inference that no reliable conditioning 
occuited for the schizophrenic group. It may then 
be concluded, because the over-all Trials effect 
for all groups is significant, that conditioning 
occurred in the neurotic group. 

Since the neurotic group had demonstrated 
acquisition, their data for Trial Blocks 4 through 
8 were analyzed for extinction. As suggested in 
Fig. 1, no reliably significant F was obtained for 
Trials. No evidence of extinction was found. 

Initial questioning of Ss with regard to the 
purpose of the experiment and the possibility that 
E had influenced their responses in any way re- 
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vealed no clear evidence of awareness of the rein- 
forcement contingency. Further questioning of a 
more specific nature, in which £ explicitly asked 
if his saying “Good” had any effect on S’s re- 
sponses, did result in “insight” statements by one 
schizophrenic and five neurotics. The acquisition 
scores for these Ss inicated no better conditioning 
than the scores for the remaining 26 who furnished 
no evidence of awareness. 


DISCUSSION 


The acquisition results confirm the hypothesis 
that reinforcement effects are generally negligible 
for schizophrenic patients. This held true for both 
response classes to which the reinforcer was 
applied (either first or third person pronouns). In 
contrast, the neurotic groups showed significant 
acquisition curves. Again, the response class made 
no difference. 

The combined acquisition curve for the two 
neurotic groups resembles that obtained for a 
group of nonpsychiatric patients reported in a 
previous paper (Cohen et al., 1954). The combined 
acquisition curve for the schizophrenic groups 
resembles that obtained for the nonreinforcement 
control group in the previous study. In other 
words, saying “Good” to the schizophrenic Ss 
was apparently functionally equivalent to saying 
nothing to nonschizophrenic Ss. 

The failure to demonstrate extinction as a con- 
sequence of nonreinforcement, as seen in the results 
for the schizophrenic group, is to have been ex- 
pected since no acquisition was demonstrated. On 
the other hand, acquisition was demonstrated by 
the neurotics, and yet extinction was not achieved 
under nonreinforcement. This finding is in accord 
with data for nonpsychiatric patients under a 
comparable extinction procedure (Cohen et al., 
1954). The possible explanation cited in that study, 
referring to the existence of continuing sources of 
reinforcement inherent in the procedure, also 
applies to this study. Another possible explanation 
involves the relatively low frequency of the 
reinforced response at the initiation of the extinc- 
tion procedure. It may be noted from Fig. 1 that 
the frequency of I and We for the neurotic Ss was 
less than 11 for 20 cards at the time extinction 
was initiated. Since a previous study (Cohen et 
al., 1954) continued the reinforcement procedure 
for another 80 cards without any increase in 
frequency of the reinforced pronoun class, the 
solution to increasing its frequency would not 
seem to lie in additional reinforcement trials prior 
to initiation of extinction procedures. One possible 
solution may be to decrease the number of possible 
different responses, thus raising the initial level 
and the final acquisition level as well. For example, 
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You and She could be eliminated. and I, We, He, 
and They retained. 

Certain limits on the generality of the present 
results, regarding the nonsusceptibility to rein- 
forcement of schizophrenic patients, should be 
specified. These include the limits set by (a) 
properties of the stimulus class employed as a 
reinforcer and (6) properties of the response class 
subjected to reinforcement operations. 

In the present experiment, the reinforcing 
stimulus (Good) was one that connotes social 
approval. £’s explicit responses were administered 
so as to provide reward for the desired response 
class rather than punishment of responses in a 
competing class. Results summarized by Rodnick 
and Garmezy (1957) suggest that schizophrenic 
Ss are especially sensitive to verbal punishment. 
It is possible that the aversive properties of verbal 
punishment could result in more normal acquisi- 
tion curves for schizophrenic Ss. A receut study by 
Leventhal (1958) provides data in support of this 
prediction. 


Studies of motor learning (Cohen, 1956), 


reaction time (Rosenbaum, Mackavey, & Grisell, 
1957), and operant behavior (Lindsley, 1956) of 
schizophrenic Ss support the hypothesis of Hunt 
and Cofer (1944) that psychological deficit in 
schizophrenia reflects a loss of social motivation 
based on a process of extinction of responsiveness 


to acquired incentives and rewards. If schizophre- 
nic patients have not incurred some irreversible 
impairment of the ability to respond to acquired 
reinforcers, one might predict that the experimen- 
tal pairing of a stimulus like the word Good with a 
primary reinforcer would result in its acquisition 
(or reaquisition?) of reinforcing properties. 

In an interview-type setting, Salzinger and 
Pisoni (in press) applied verbal reinforcing stimuli 
(Recognition and Agreement) to schizophrenic 
patients’ verbal responses of a “personal-affective” 
type. Significant operant acquisition effects were 
demonstrated. No changes were found in nonaffec- 
tive self-reference responses during the same 
interview. In the present experimental procedure, 
the reinforcer was applied to responses in the pre- 
arranged classes, regardless of any specific 
affective meanings the responses may have had for 
the individual patients. 

A study by Hartman (1955) suggests that the 
initial (operant level) response subjected to rein- 
forcement operations may be of particular impor- 
tance for schizophrenic patients. Using a technique 
similar to that of the present study, Hartman 
subdivided his Ss in accordance with the initial 
levels of preference they showed for the to-be- 
reinforced pronoun class. For his schizophrenic Ss, 
he found significant acquisition effects for the 
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subgroup which showed a high initial preference 
for the reinforced pronoun class. In the present 
study, reinforcement was administered to responses 
without regard for prereinforcement preferences. 

Impaired susceptibility to verbal reinforcement, 
in view of the present results, warrants considera- 
tion as an objective, experimentally manipulable 
correlate of schizophrenic behavior. It is recognized 
that many investigators conceive of schizophrenic 
impairment of responsiveness to social cues (e.g., 
disturbances in reality contact) in terms of changes 
in perceptual and cognitive variables. Studies can 
be designed to clarify the relations of these vari- 
ables to those of verbal reinforcement. In any 
event, further systematic investigation of the 
various parameters of verbal reinforcement in 
normal and pathological groups appears both 
warranted and promising. 
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EGO DISJUNCTION: 
A FAILURE TO REPLICATE TREHUB’S RESULTS 


WILBERT E. FORDYCE 
Veterans Administration Hospital, Seattle, Washington 


AnD WILLIAM R. CROW 
University of Washington 


REHUB (in press) introduced the concept of 

ego disjunction (ED), defining it as “the 
concurrence in an individual of two or more rela- 
tively strong, relatively long enduring needs or 
characterological dispositions with mutually in- 
compatible objectives.” ED was measured by the 
Edwards Personal Preference Schedule (EPPS) 
Edwards, 1954), on which seven pairs of a prior 
incompatible needs were taken to indicate the 
level of ED. Trehub found a monotonic linear 
relationship between ED scores on the EPPS and 
diagnostic category, ranging from low ED for 


college students, up through adolescents, neurotics, 
character disorders, and schizophrenics. The last 
three groups were patients at a VA Neuropsychia- 
tric hospital. 

This study replicated that part of Trehub’s 
study involving patient groups. In addition to the 
ED measure, a series of quantitative indices of 
disorganization based on the MMPI were com- 
pared with ED scores and with the diagnostic 
criterion. MMPI measures used (all expressed in T 
scores) were Pt-Sc plus 100 (Delta); Sc plus 2 
(Se-Pt), with Sc scores above 90 considered as 90 
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(Gamma); Pa plus Pt plus Sc (Triad); and the 
39-item form of the Edwards (1953) Social De- 
sirability scale (SD).* 

Initially the diagnostic groups studied were 
drawn from a population of adult, male, veteran, 
neuropsychiatric patients at the VA hospital 
(GM and S), Seattle, Washington. There were 20 
acute schizophrenics (Ac-Sc), 31 character dis- 
orders (Ch-Dis), and 22 neurotics (Neu) as classi- 
fied by a single, final diagnosis assigned by the 
hospital staff. No cases with multiple psychiatric 
diagnoses were used. Later, a further sampling of 
more chronic schizophrenic patients was made to 
extend the range of severity of disorganization 
tapped by the tests. Under the same criterion of 
final hospital diagnosis, chronic male schizophre- 
nics (Ch-Sc) were added—21 from Western State 
Hospital, Steilacoom, Washington, and 10 from 
the VA hospital (NP), American Lake, Washing- 
ton.? These two subsamples were combined. 

All subjects performed on the EPPS and the 
booklet form of the MMPI either the same day or 
within a very few days of each other. 


RESULTS 


The data failed to duplicate Trehub’s findings. 

Table 1 shows the means and standard devia- 
tions on ED and the MMPI scores for each 
diagnostic group. Table 2 shows the ¢ test compari- 
sons of the means, indicating no differences 
between any of the criterion groups on the ED 


The Delta and Gamma scores were defined arbi- 
trarily and are designed to treat differently the shape 
and elevation factors of the Pt and Sc scales. 

*We are indebted to F. E. Shoviain, Superin- 
tendent; R. W. Brown, Clinical Director; and Mack F. 
Knutsen, Supervising Psychologist, all of Western 
State Hospital; and to Leon S. Diamond, Chief of 
Professional Services and John Marks, Chief, Psy- 
chiatric Research Unit, VA hospital, American Lake, 
Washington, for their help in providing data for this 
study. 


measure. In contrast, the MMPI indices of dis- 
organization more consistently discriminate the 
schizophrenic groups from the character disorder 
group, though not from the neurotic group. 


Discussion 


The failure of the ED measure to discriminate 
schizophrenic patients from the character disorder 
or neurotic patient groups raises a serious question 
as to the validity of the EPPS measure of the ego 
disjunction construct. While no adolescent or 
college student groups were included in this study, 
the range of presumed levels of ego disjunction 
represented in the four patient groups studied 
should be considerable. The success of the MMPI 
indices in discriminating the groups lends support 
to the inference that a range of sickness levels or 
ego disjunction does exist in the groups studied 
There was no criterion contamination in the 
chronic schizophrenic group because the MMPI 
was administered after diagnoses were made. In 
the other groups, the MMPI as a part of the 
diagnostic workup process may have contributed 
in varying amounts to final diagnosis. Criterion 
contamination could account for differences found 
between neurotic and character disorder groups 
and the acute schizophrenic zroup. It is obvious, 
however, that far more than the MMPI was in- 
volved in arriving at a final diagnosis. 

Failure of ED to discriminate when the MMPI 
indices succeeded suggests that sampling differ- 
ences may have influenced Trehub’s original 
findings. Differences in diagnostic criteria may 
have entered into these conflicting findings. It 
seems possible that cultural differences in test 
behavior between patients from New England and 
from the Pacific Northwest may contribute to the 
different findings of the two studies. 

The fair success of the MMPI measures used 
here in discriminating psychotics from nonpsychot- 
ics offers some promise as a quantitative, objective 
criterion for use in other studies. 
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TABLE 2 


DISCRIMINATION OF D1aGNnostic Groups By / Tests on ED 


AND MMPI Inpices 
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SUMMARY 

Trehub’s EEP-derived ego disjunction measure 
was compared with a series of MMPI-derived 
measures of ego disorganization on groups classified 
by psychiatric diagnosis. In contrast to Trehub’s 
original study, the ego disjunction measure failed 
to discriminate among neurotics, character dis- 
orders, acute schizophrenics, and chronic schizo- 
phrenics. All MMPI measures except the Social 
Desirability scale discriminated schizophrenics 
from nonschizophrenics. 

It was concluded that sampling differences, 
differences in diagnostic practice, or cultural 
differences in patient groups studied could account 
for the conflicting findings of this study and the 
original work by Trehub. 
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* Correction for unequal variances made by calculating d/ as N — 1 of the smaller group. 


Success of the MMPI indices in discriminating 
psychotics from nonpsychotics offers promise of 
the utility of these indices in other studies. 
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A NOTE CONCERNING HOMOGENEITY OF MEMBERSHIP AND GROUP PROBLEM 
SOLVING 


MARVIN E. SHAW 
Massachusetts Institute of Technology 


He intivia of group members with regard 


to individual differences has long been 
recognized as an important variable in group 
behavior (Hemphill, 1950). In a recent article, 
Hoffman (1959) reports results supporting the 
hypothesis that homogeneity results in less effi- 
cient problem solving than does nonhomogeneity 
of membership. Citing the findings of Pelz (1956) 
to support his view, he suggests that “... the 
results reported in the present study are probably 
generalizable well beyond the limited population 
of college students who supplied the data.” (Hoff- 


man, 1958, p. 31). It was this statement which 
motivated the writer to report some of his own 
observations of relations between homogeneity 
and group effectiveness." 

During the past few years, the writer has 
conducted a series of experiments attempting to 
study the joint effects of group structure and 


' The purpose of this note is not to criticize Hoff- 
man’s excellent report, but only to offer a word of 
caution regarding the generalizability of the results and 
a word of encouragement to others who may be inter- 
ested in this problem. 
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TABLE 1 
Tue RELATION OF MemBer HomoGENEITY TO GROUP PERFORMANCE AND SATISFACTION 








Homogeneity Variables Structural Conditions 


Group Process Variables 
Time Scores 





Trials to Target 


Ratings of 
Satisfaction 





Intelligence (Shaw, Power: 
1959a) Centralized 
Decentralized 
Communication: 
Centralized 
Decentralized 
Power: 
Centralized 
Decentralized 
Communication: 
Centralized 
Decentralized 
Power: 
Centralized 
Decentralized 


Acceptance of Author- 
ity (Shaw, 1959a) 


Individual Prominence 
(Shaw, 1959b) 


20 
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—.11 


—.18 
+.20 


em ah. 
88 25 


22 
22 


i+ + 
55 





Note.—Signs of correlations have been adjusted so that a positive correlation indicates increasing efficiency or satisfaction with in- 


creasing homogeneity. 
* 9 < 0S, 
** 9 < 01. 


composition upon group efficiency in problem 
solving situations. (Shaw, 1959a, 1959b). While 
not related to the primary purpose of the investi- 
gations, the data were analyzed systematically 
to determine the relationship between various 
estimates of homogeneity and problem solving 
effectiveness under varying structural conditions. 
The results of these analyses are given in Table 
1. The measure of homogeneity used in computing 
the correlations reported was the average deviation 
of scores earned by members of the group on 
measures of intelligence (the verbal scale of the 
Scholastic Aptitude Tests), acceptance of au- 
thority, and individual prominence. Essentially 
the same results were obtained, however, when 
other indices of homogeneity were used, such as 
range and standard deviation. Measures of group 
process variables were number of trials to criterion, 
total time required by the group to complete the 
assigned task, and average ratings of satisfaction 
by group members 

A number of comments mer be made in con- 
nection with the results reported in Table 1. First, 
there seems to be no systematic relation between 
homogeneity and group efficiency and satisfaction. 
Degree and direction of correlations vary with the 
group structure and the particular characteristic 
under consideration. 

Second, most of the correlations fail to meet 
the usual requirements for statistical significance. 
This may be due to the small number of cases in 


each sample, to the fact that the homogeneity 
estimate is based upon a single characteristic 
rather than a profile as were Hoffman’s measures, 
or to the fact that no relation exists. It is interesting 
to note, however, that the one condition yielding 
uniformly significant correlations (acceptance 
of authority in a decentralized power structure) 
is precisely the one where such a relation might 
be expected on a priori grounds. 

Third, centralized structures appear consistently 
to produce correlations that differ from those 
found in decentralized structures, although these 
differences are mostly unreliable. 

These findings point to the difficulties in 
generalizing the results of a single study or even 
several studies beyond the limits imposed by the 
experimental situations. It may be, of course, 
that the effects of profile homogeneity will prove 
to be unchanged by such extraneous variables as 
group structure, group task, and the like; however, 
it seems far more likely that the effects of homo- 
geneity, like so many other variables in group 
behavior, will be found to vary with the particular 
conditions under which its effects are measured. 

The general problem of homogeneity as a 
variable in group behavior is a tremendously 
important one and deserves extensive investiga- 
tion. The limited evidence reported here, along 
with the findings of Hoffman (1959) and Pelz 
(1956), indicates that the results of such studies 
will richly repay the investigator for his efforts. 
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THE BODY IMAGE BARRIER CONCEPT' 


CHARLES R. EIGENBRODE anp WILLIAM G. SHIPMAN 
Staunton Clinic, School of Medicine, University of Pittsburgh 


coop deal of the recent research on the body 

image has been done by Fisher and Cleve- 
land. Their initial study of rheumatoid arthritics 
(Cleveland & Fisher, 1954) led them to focus on 
the external limits of the body image, which may 
serve unconsciously as a defensive barrier to ward 
off threats from the outside and to hold disturbing, 
hostile impulses within. A scoring system for 
Rorschach protocols was derived which yields a 
Barrier score—an index of the degree of concern a 
subject has about his body limits. This measure 
was validated on a group of patients with internal 
and external disorders (Fisher & Cleveland, 1955), 
then applied with interesting findings to cancer 
patients (Fisher & Cleveland, 1956b), to style of 
life (Fisher & Cleveland, 1956a), and to physio- 
logical reactivity (Fisher & Cleveland, 1958; 
Fisher, 1959). It has been assumed in these papers 
that the measure represents a psychological factor 
that is not only a strong determinant of locus of 
disease but is related to central personality 
tendencies. The purpose of the present study was 
to determine how well the Barrier score differen- 
tiates between patients with internal and 
external psychosomatic illnesses. 


METHOD 


Sample. The Rorschach records used in this 
study were obtained from the case files of Staunton 
Clinic. The medical record of each patient was 
checked so that assignment to one of the experi- 
mental groups could be made. No patient was 
included in the sample who (c) had symptoms of 
both internal and external disease, (6) was below 


1 The authors wish to thank Seymour Fisher and 
Stanley King for personally clarifying the scoring 
system. Our associate, Leslie Berger, made many 
helpful suggestions in the latter stages of the project. 


16 or above 50 years of age, or (c) had less than 10 
responses in his Rorschach. Cases were drawn so 
that the two patient groups were similar in age, 
education, and IQ. Two groups of records were 
obtained: 54 cases of psychosomatic skin disorders 
(26 atopic eczema, 16 chronic urticaria, and 12 
miscellaneous skin disorders) and 29 cases of 
internal psychosomatic disorders (18 stomach 
ulcer, 8 ulcerative colitis, and 3 genitourinary 
disease). Emotional factors were considered by the 
physicians to be the major factor in all skin 
cases. 

Scoring. After learning the scoring system, the 
authors collaboratively scored the 83 records, 
which had been randomly mixed and from which all 
primary identification material had been removed. 
The number of scored responses was equated for 
the two groups by setting a maximum of 20 scored 
responses per record. This control was successful 
in that the median number of responses per record 
in both groups was 20. After three to five days, 30 
of the 83 records were pulled at random and re- 
scored as a check on the consistency with which 
the scoring system had been applied. This com- 
parison yielded a Peerson correlation of .95. At 
this point, Fisher was consulted. His scoring of 10 
of these records yielded a Pearson r of .84 with 
that of the authors. This consultation indicated 
that several basic changes in the scoring system 
used by the authors needed to be made in order to 
coincide more closely with Fisher and Cleveland’s 
actual practice. This prompted a rescoring of the 83 
records by the two authors individually. The 
extent of their agreement was indicated by a 
Pearson r of .80, which compares favorably with 
an r of .82 reported in the original monograph 
(Fisher & Cleveland, 1955). 
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RESULTS AND DISCUSSION 


A nonparametric median test was applied to the 
Barrier score data of both of the writers fer the 
interval vs. the external disease groups. Neither 
of the obtained chi squares approached signifi- 
cance. Indeed, the obtained frequencies were 
almost identical with those expected by chance. 
In the light of this absence of difference in the 
Barrier scores obtained by patients with internal 
and external diseases, it seems doubtful that a 
concern with firmness of body boundaries plays a 
significant role in determining the location of an 
illness in the body. 

Where does the reason for this discrepant 
finding lie: in the sample, in the measuring 
instrument, in the theory, or in some combination? 

The sample differs from that used in the original 
study mainly in disease distribution anc sex 
ratio. In this study, the sex ratio for the various 
diseases was kept similar to the known distribution 
of the disorders in the general population. But sex 
differences should not be an important source of 
discrepancy since both Fisher and Cleveland and 
the present investigators found no sex difference in 
the Barrier score. 

The scoring system may have been misapplied 
or have intrinsic weaknesses that allow different 
results in different studies. The correlation of .84 
between the initial scoring of the writers and 
Fisher, plus the subsequent rescoring, suggests 
that reliable scoring was not a problem. This turns 
the focus on the Barrier score itself, which seems 
open to considerable criticism. The scoring rules, 
in detail, have not been published. No retest 
reliability has been reported. And most important 
of all, no cross-validation of the scale seems to have 
been published by other investigators. The small 
size of the modal Barrier score, 4, and the range, 
0 to 12 in this study, aliows great instability of 
scores. Thus, a subject’s classification as “high” 
or “low” may also be unstable. 

This highlights another variable, the examiner. 
There is a great range in the amount of inquiry 
that is done during the administration of a 
Rorschach, and the presence or absence of addi- 
tional descriptive material most certainly affects 
the opportunity for Barrier scores. To check on 
Barrier score differences between examiners, a 
nonparametric analysis of variance by ranks was 
done on the scores obtained by the seven examiners 
who tested 78% of the cases used here. The 
resultant Kruskal-Wallis H was statistically 
nonsignificant, and it was concluded, therefore, 
that differences between examiners did not cause 
significant differences in the number of Barrier 
responses given by the subjects in this sample. 

Conceivably, the scale does not measure the 
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Barrier concept very well. In many of the major 
scoring categories there is a great deal of apparent 
arbitrariness about which Rorschach responses fit- 
ting the category may be scored. For example, when 
considering animals with unusual skins, a mink is 
scored but a possum is not. Other inconsistencies 
emerge in the “unusual skin” category which may 
contribute a great deal to the total. And why is not 
the opposite end of.the dimension scored? One 
would think that minus scores for such soft 
surfaced things as sponges and wool would im- 
prove the scoring. Neglecting to score responses at 
one end of the continuum shouid hypothetically 
greatly reduce the sensitivity of the measure. 
These various points lead one to conclude that 
there are some striking gaps in the application of 
the Barrier principle to the scoring system. 

The theory itself warrants careful inspection. 
The scoring instructions refer to “an individual’s 
perception of his body boundaries as definite and as 
possessing protective properties.” There are ob- 
viously two features involved here, and it is a 
question whether these two features, definiteness 
and protectiveness, are necessarily related. A 
person may be well aware of his limits but not 
necessarily fearful of attack and defensively 
oriented. Presumably, body limits are clearer with 
emotional maturity, whereas a preoccupation with 
defense to external threat occurs at a less mature 
level. Similarly, the Barrier theory scores with 
equal weight, containers and defensive covering. 
Though these both involve a hard, encircling 
surface, the points of view involved seem quite 
different. One wonders if these two concepts 
really represent only one psychological variable or 
whether there are at least two factors involved. 
The point, then, is that the Barrier concept may 
entail different and not entirely consistent dimen- 
sions. This complexity of the over-all measure 
would be acceptable if the role of the subfactors 
were known and the measure correlated success- 
fully with other variables. But under the present 
circumstances, some of the subcategories may be 
measuring the Barrier tendency while others, which 
do not, cancel the relation when the subcategories 
are combined. 

SUMMARY 

A check was made on whether the Rorschach 
Barrier scores of 54 patients with external psycho- 
somatic disorders were greater than those of 29 
patients with internal psychosomatic disorders. 
Statistical aralysis showed no evidence of differ- 
ence in the distribution of Barrier scores in the 
two groups. “he potentialities of the Barrier 
approach car:aot be truly assessed until ambiguities 
in the theory are clarified and until the scale is 
improved as a measuring instrument. 
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